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Foreword 


Welcome to WorldatWork Course GR2: Quantitative Methods. 


To achieve the greatest benefit from this program, your active participation is encouraged. 
Please feel free to ask questions relevant to the topics presented and share your ideas and 
experiences with the instructors and other participants. Additionally, the coffee breaks and 
lunches are excellent forums for sharing thoughts and ideas about the remuneration 


function. 


WorldatWork courses are developed through the efforts of many individuals, especially 
those who serve as WorldatWork faculty. Specifically, WorldatWork wishes to 
acknowledge the following current contributors to this course: 


James R. Beatty, Ph.D., CCP, San Diego State University 
David P. Bell, Ph.D., CCP, D. P. Associates 
Julie A. Elliott, CCP, E. W. Scripps Company 


Jason C. Kovac, CCP, CBP, WorldatWork 
Douglas A. Long, CCP, CFM, AmSouth Investment Services, Inc. 


Rebecca |. Wood, CCP 


In creating the WorldatWork educational program, we seek to offer the highest quality 
professional development opportunities for human resources professionals with 
concentrations in remuneration, benefits and total rewards. We invite your opinions and 
suggestions about the course and its content to assist us in updating our courses and to 


ensure they are meeting your needs. 


Thank you for your participation. 


Version 3.05.3 


The information in this publication, as prepared by the authors and reviewed by other experts in the field, 
represents various approaches to remuneration and benefits management. WorldatWork strives to provide 
information and perspectives that will help readers apply appropriate practices in their own organisations, 
but does not provide prescriptive advice. Nothing herein should be construed as an attempt to aid or hinder 
the adoption of any pending legislation, regulation or interpretive rule, or as legal, accounting, actuarial or 
other such professional advice. 
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Course GR2 


Course Overview 


This basic-level course is one of the CCP/CBP common core courses offered by 
WorldatWork. It focuses on general quantitative concepts, basic statistical tools, the 
mathematics of total remuneration design and administration, mathematical modelling, 
and regression analysis. Participants learn to consider data from numerous sources, with 
an emphasis on problem solving and decision making. 


Intended Audience 


This course is intended for those individuals who are relatively new to the field as well as 
experienced practitioners who seek a basic treatment for applying statistics in total 
remuneration management. Participants will benefit most from this course if they are 
proficient in the concepts covered in Course GR1 — Total Remuneration Management. It is 
assumed that participants have had no formal exposure to statistics, but have had a basic 
algebra course. 


Preparing for the Course 


Please visit the Education section of the Web site at www.worldatwork.org prior to 
attending to determine whether there is an amendment sheet published for your course 
book. Amendment sheets include minor corrections and updates (not affecting the 
examinations) that are intended to maximise the currency of the course books between 
publishing dates. The amendment sheets can be viewed and/or printed from the Web site. 


Please be sure to bring the TI BAI Plus calculator with you to the class for various 
case studies and exercises. 


Calculator 


If you have registered for the course, you will have received a TI BAII Plus calculator. 
Please refer to the pages at the end of this section that explain how to perform the major 
functions that will be covered by the class. 
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Thank You 


Thank you for choosing WorldatWork to support your professional development efforts. 

If you have any questions about the course or are missing any of the course materials 
previously mentioned, please call WorldatWork headquarters at 877/951-9191. Our mailing 
address is 14040 N. Northsight Blvd., Scottsdale, AZ 85260. 


Our Pledge to You 


WorldatWork is committed to providing you with high-quality products and services. Our 
staff at the association’s global headquarters in Scottsdale, Ariz., is dedicated to your 
satisfaction. If we do not meet your expectations — or if you wish to share any comments, 
suggestions or questions — please feel free to contact us at any time. 

We appreciate your support. 


WorldatWork 

Toll-free 877/951-9191 

Fax 480/483-8352 
customerrelations@worldatwork.org 
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Overview: Course GR2 


Module 1: Statistics — Data, Information and Levels of Measurement introduces 
basic quantitative concepts and the four levels of measurement. 


m Why HR professionals collect and use data 
m™ Five key questions to ask about the variable of interest 
m™@ Levels of measurement 


Module 2: Percents and Related Issues defines and compares percents, market 
index and compa-ratios. 


Percents 

Individual compa-ratio 
Department compa-ratio 

Market index 

Percent difference 

Developing salary ranges 
Percents in benefits management 


Module 3: Time Value of Money addresses the time value of money, including 
compound interest and the compound salary growth rate. 


Time value of money 
Compound interest and compound salary growth rate 


Constant midpoint progression 


Annuity payments 


Module 4: Statistics — Collecting, Organising, Grouping and Displaying Data 
introduces statistical analysis. 


m Populations and samples 
m@ Frequency distributions 
— Organise, group and display data 
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OVERVIEW: COURSE GR2 


Module 5: Statistics — Lying with Statistics, Graphs and Displays explains how to 
accurately display data. 


™ Recognising distorted data 
m Mistakes that distort data 


Module 6: Statistics - Measures of Central Tendency and/or Location addresses 
measures of central tendency and measures of central location. 


™ Measures of central tendency 
@ Measures of location 
@ Percentile bars 


Module 7: Measures of Variability discusses measures of variability. 


™ Measures of variability 
— Range 
—  Interquartile range 
— Standard deviation 
™ z-scores 


Module 8: Statistics - Shapes of Distributions addresses shapes of distributions. 


m™ Interpreting distributions 
@ Normal distribution 


Module 9: Regression Analysis introduces modelling and regression analysis. 


Regression models in an HR environment 
Developing a regression model 
Cautions in the interpretation of correlations 


Multiple regression 


Appendix 


™@ Quiz answers 
™ Support materials for course modules 
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OVERVIEW: COURSE GR2 


Resources 


@ Related articles 
@ Sources for additional information 


Glossary 


™ Definitions of terms applicable to this course 


Notes 


@ Blank note pages 


Faculty 


m@ List of selected faculty for this course 


Exams 


@ Information on WorldatWork Society of Certification Professionals certification 
exams 
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How to Succeed 


Become familiar with the calculator. 
Do the exercises and cases presented during the course. 
Practice what you learn. 


Ask questions. 


Review the glossary of terms. 
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Quick Tutorial — The Calculator 


The numbers you will be asked to enter into the calculator are simply example numbers 
that do not correspond to a problem. The purpose is to have you get familiar with the 
calculator. 


This calculator has two modes of operation: the standard calculator mode and the 
prompted worksheet mode. 


The standard calculator mode is designed for basic arithmetic functions and to calculate 
financial values such as present value, future value and compound interest rate. 

The prompted worksheet mode guides you through more advanced calculations such as 
the mean, standard deviation, variance, intercept, slope, correlation and coefficient of 
determination. 

Most keys on the calculator have two functions. These special key functions are indicated 
by notations above the keys. For example, the “0” key has a second function of “Mem” 
(memory). The 0 is the first function of that key, and the Mem (memory) is the second 


function of that key. To use the first function, press the key. To use the second function, 
press the “2nd” key (located on the upper left-hand corner of the keyboard) and the desired 


function key. 
In this brief calculator tutorial, we will use a rectangle] | to indicate a first function 


and an oval Cc > to indicate a second function. 


Before we start, we will arbitrarily select two decimal places to appear on the display. 
Perform this task as follows: 


Note: Not all calculations will require two decimal places. The format key can be used to 
enter any single-digit number of places. 
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The keys we will use to calculate certain statistical values are shown below: 
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Mean, Standard Deviation 
and Variance 


These pages illustrate keystroke sequences used to calculate various statistics. The 
numbers are not actual problem data. 


Procedure Press Display 


1. Select data-entry portion 2nd Data X01 = (Old contents) 
of statistics worksheet. 


2. Clear worksheet. 2nd X01 = 0.00 


3. Enter data. 
a) Enter 1st data point. 2000 Enter X01 = 2,000.00 


Move down 2 displays. Vv Y X02 = 0.00 


Move down 2 displays. X03 = 0.00 


c) Enter 3rd data point. 2300 Enter X03 = 2,300.00 


Move down 1 display. + Y03 = 1.00 
This shows the number 
of entries in the 3rd field. 


d) Enter the number Y03 = 2.00 


of times 2300 occurs. 


Move down to the next 


display. 


e) Enter 4th data point. 2500 Enter X04 = 2,500.00 


4. Select statistical calculation (Old contents) 


portion of statistics 
worksheet. 
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Mean, Standard Deviation 
and Variance (Cont'd) 


Procedure Press Display 


5. Clear worksheet. 2nd Cir Work LIN 


6. Select one-variable 2nd 1-V 


calculation method. (4 times) 


7. View calculations. 


2,260.00 


€ 

€ 

x! 
TT 


a) mean 


8. If the data represent a sample 


a) standard deviation SX = 181.66 
b) variance SX = 33,000.00 


9. If the data represent a population 


a) standard deviation v oX = 162.48 
b) variance x? oX = 26,400.00 
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Data Worksheet 


Procedure 


1. Select data-entry portion 


of statistics worksheet. 
Clear worksheet. 

Enter the pairs of data. 
a) Enter the first values 


(beginning with the 
x-value). 


Move to next display. 
Enter the first y-value. 
Move to the next display. 
c) Enter the 2nd x-value. 
Move to next display. 
d) Enter the 2nd y-value. 
Move to next display. 
e) Enter the 3rd x-value. 
Move to next display. 
f) Enter the 3rd y-value. 
Move to next display. 
Enter the 4th x-value. 
Move to next display. 
h) Enter the 4th y-value. 


Move to next display. 


Press Display 
2nd Data X01 (Old contents) 
2nd X01 = 0.00 
400 Enter X01 = 400.00 
+ YO1 = 1.00 


X02 = 0.00 


500.00 


[wv | 


500 Enter X02 = 


v Y02 = 1.00 


X03= 0.00 


X03 = 600.00 


600 Enter 


v Y03 = 1.00 


X04 = 0.00 


X04 = 700.00 


700 Enter 


v Y04= 1.00 
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Intercept, Slope, Correlation 
and Coefficient of Determination 
in Linear Regression 


Procedure Press Display 


4. Select statistical calculation 2nd Stat (Old contents) 
portion of statistics worksheet. 


5. Clear worksheet. 2nd LIN 


6. To view the computations. Press vy 7 times oy 353.55 


a 2,850.00 


a) Compute intercept a. 


r 0.95 


b) Compute slope b. vo b 3.00 
c) Compute correlation r. 


d) Compute coefficient x a 0.90 


of determination. 
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Calculation of Future Value — 
Compute Using Percent Change/ 
Compound Interest Worksheet 


Example: 


Jim and Mary would like to take their two children on a vacation in four years. The travel 
agent advised them that the trip would cost approximately 15,000. Mary has checked 
around and found a bank that will pay 6% annual interest, compounded quarterly. How 
much must Mary put in the bank now to have 15,000 at the end of four years? 


Procedure Press Display 


1. Select percent change/ 2nd A% OLD = (Old contents) 
compound interest worksheet. 


3. Move to (new) future value display. y NEW = 0.00 

4. Enter value (15000). 

5. Move to interest (%CH) display. y %CH = 0.00 

6. Enter interest (%CH) (6/4 = 1.5). 

7. Move to number of periods (#PD) & #PD = 1.00 
display. 

8. Enter total number of periods 
(4x 4 = 16). 

9. Move to (old) present value display. y OLD = 0.00 


10. Compute (old) present value. OLD = 11,820.47* 


Answer: 11,820.47 must be placed in the bank now to have 15,000 at the end of four 


years at 6% annual interest, compounded quarterly. 


GR2 # 0.15 


© WorldatWork. All rights reserved. 


Basic Functions of the Calculator 


Procedure Press Display 


To enter statistical data 2nd Data X01 = 0.00 


To choose a statistics calculator LIN, Ln, EXP, 
PWR or 1-V 


method and compute results 


To clear a worksheet 2nd = Clr Work First variable 
in the worksheet 


To return to standard calculation 


mode from a worksheet 


To fix calculations to 2 2nd Format 2 Enter DEC = 2.00 
decimal points 


To clear incorrect entry CE/C 


To clear a calculation CEIC CE/G 0.00 
started but not completed 


To recall a value in Memory 3 RCL 3 3.50 
To display results in Memory 3 RCL 3 10.00 
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Using the Statistics Worksheet 


You use two key sequences when performing statistics calculations. To 
enter statistical data, press [Data]. To choose a statistics calculation 
method and compute results, press [Stat]. 


Statistics Worksheet Keys and Labels 


Key Type of 
Sequence Label Meaning Variable 
2nd] [Data] Xnn* Current X value Enter-only 
a) Ynn* Current Y value Enter-only 
[Stat] LIN Standard linear regression Setting 
2nd) [SET] Ln Logarithmic regression Setting 
EXP Exponential regression Setting 
PWR Power regression Setting 
1-V One-variable statistics Setting 
n Number of observations Auto-compute 
(as x Mean (average) of X values Auto-compute 
needed) Sx Sample standard deviation of X Auto-compute 
ox Population standard deviation | Auto-compute 
of X 
vy Mean (average) of Y values Auto-compute 
Sy** Sample standard deviation of Y Auto-compute 
oy** Population standard deviation | Auto-compute 
of Y 
a** Linear regression y-intercept Auto-compute 
b** Linear regression slope Auto-compute 
r** Correlation coefficient Auto-compute 
x"* Predicted X value Enter/compute 
Ye" Predicted Y value Enter/compute 
=X Sum of X values Auto-compute 
=X2 Sum of X squared values Auto-compute 
“yY** Sum of Y values Auto-compute 
L¥2** Sum of Y squared values Auto-compute 
=xY** Sum of XY products Auto-compute 


* nn indicates the number of the current X or Y value. 


** Not displayed for one-variable statistics. 


Reproduced from Texas Instruments BAII Plus calculator user guide located online at: 


http://support.education.ti.com/SRVS/CGI-BIN/WEBCGI.EXE/,/?St=80,E=0000000000003727911, 


K=3305,Sxi=4,Case=obj(3654) 
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Using the Other Worksheets 


The Percent Change/Compound Interest Worksheet 


To access the Percent Change/Compound Interest worksheet, press 
[A%]. You can also compute compound interest or perform cost-sell- 
markup calculations. 


Percent Change/Compound Interest Worksheet Labels 


Label Meaning Type of Variable 
OLD Old value Enter/compute 
NEW New value Enter/compute 
“CH Percent change Enter/compute 
#PD Number of periods Enter/compute 


Notes on the Percent Change/Compound Interest Worksheet 
[CLR Work] sets OLD, NEW, and %CH to zero and sets #PD to 1. 
[Reset] sets OLD, NEW, and %CH to zero and sets #PD to 1. 


For percent change calculations, you can enter values for two of the three 
variables (OLD, NEW, and %CH) and compute a value for the unknown 
variable. (Leave the value for #PD as 1.) A positive percent change 
represents a percentage increase, and a negative percent change represents 
a percentage decrease. 


For compound interest calculations, you can enter values for three of the 
four variables and compute a value for the unknown variable. OLD = 
present value, NEW = future value, %CH = interest rate per period, and #PD 
= number of periods. 


For cost-sell-markup calculations, you can enter values for two of the three 
variables (OLD, NEW, and %CH) and compute a value for the unknown 
variable. OLD = cost, NEW = selling price, and %CH = percent markup. 
(Leave the value for #PD as 1.) 


Reproduced from Texas Instruments BAII Plus calculator user guide located online at: 


http://support.education.ti.com/SRVS/CGI-BIN/WEBCGI.EXE/,/?St=80,E=0000000000003727911, 
K=3305,Sxi=4,Case=obj(3654) 
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Module 1 


Statistics: 
Data, Information and 
Levels of Measurement 


WorldatWork B 
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Module 1: Statistics: Data, Information 
and Levels of Measurement 


Introduction 


Human resources professionals collect and use data to improve chances of making sound 
decisions. Salary survey data are used to determine the appropriate value for positions in 
an organisation. Deductibles and other benefits costs are analysed when deciding the 
health care benefits an organisation will sponsor. 


In this module, we will set the foundation by reviewing the four levels of measurement and 
providing examples of each level. 


Objectives 


By the conclusion of this module, you will be able to accomplish the following: 
1. Discuss why HR professionals collect and use data. 
2. Identify five key questions to ask about the variable of interest. 


3. Identify the levels of measurement. 
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Why Do HR Professionals Collect 
and Use Data/ Information? 


@ Finance and planning 


m@ Budgeting 


Why Do HR Professionals Collect and Use Data/Information? 


Data (information) are collected and used to improve chances of making sound 
decisions. Under the broad human resources umbrella, quantitative analysis is used 
as a business tool in the following ways: 


m@ Finance and planning 


— Asastrategic business partner, HR must be aware of how design and 
implementation of programs affects the overall company financial statements. 


= Budgeting 


— HR professionals must be able to read and interpret organisational/departmental 
budgets, as program costs to an organisation can be significant. 


— The HR department has responsibility for managing the human capital budget. By 
analysing data/information, they can make effective, empirically sound decisions. 
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Why Do HR Professionals Collect 
and Use Data/Information? (Cont'd) 
# Investment management 


— Interest rates, earning/ purchasing 
power, risk and return factors 


™ Economic conditions 
— Share price volatility 
— Health care costs 


— Interest rates 


Why Do HR Professionals Collect and Use Data/Information? (Cont'd) 


m= Investment management 


— Interest rates, earning/purchasing power, risk and return factors — HR 
professionals should have a general knowledge of interest rates, earning 
power/purchasing power and risk and return factors that will assist in decision 
making for investment policy and monitoring. 


™ Economic conditions — Organisation objectives and available resources will be 
affected by current economic conditions. 


— Share price volatility 
— Health care costs 


— Interest rates 
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Why Do HR Professionals Collect 
and Use Data/Information? (Cont'd) 


= Metrics 


Monitor and measure performance goals 


— Measure return on investment 


| 


Incentive plans and payouts 


Measure outcomes of objectives 


@ Modelling 


Why Do HR Professionals Collect and Use Data/Information? (Cont'd) 


= Metrics 


— Monitor and measure performance goals — Metrics provide a quantifiable 
means of monitoring and measuring key performance goals; organisations use 
them in identifying, tracking and correcting problems as they occur. 


— Measure return on investment — Metrics can be used to measure ROI for 
proposed and existing programs. 


— Incentive plans and payouts — Metrics can be used in developing incentive/ 
commission plans and in determining payouts. 


— Measure outcomes of objectives — Metrics can be used to evaluate the 
potential success or failure of outcomes relative to an organisation's objectives. 


= Modelling 


— Modelling provides a way to test data in various scenarios in order to observe 
potential outcomes. Models may offer a clearer picture of what may occur given 
different circumstances. 
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WHY HR PROFESSIONALS COLLECT DATA 


Examples — 
Remuneration 
Applications 


m Sign-on bonuses 

m@ Pay adjustments 

@ Salary survey data 

@ Pay-for-performance programs 
= Commissions 

@ Incentives 


Examples — Remuneration Applications 
m Sign-on bonuses — implementation of sign-on bonuses 


m@ Pay adjustments — examining the consequences of adjusting pay now versus 
adjusting pay later, along with cost implications 


Salary survey data — using salary survey data to price jobs based on the market 
Pay-for-performance programs — developing pay-for-performance programs 


Commissions 


Incentives 
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WHY HR PROFESSIONALS COLLECT DATA 


. ay Examples — 
Se, Benefits Applications 


= Cost/benefit analyses 

m@ Participation levels 

™@ Workforce demographics 
m@ Plan effectiveness 


= Benefit comparisons 


Examples — Benefits Applications 


™ Cost/benefit analyses — conducting cost/benefit analyses of health and welfare 
programs 


m Participation levels — examining participation levels of employees in various benefits 
programs 


™ Workforce demographics — determining workforce demographics for anticipating 
present and future benefits needs 


Plan effectiveness — comparing benefit plan effectiveness between organisations 


Benefit comparisons — weighing benefits of a paid time off program versus vacation, 
sick and personal holiday plan 
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WHY HR PROFESSIONALS COLLECT DATA 


Examples — 
Work Experience 
Applications 


= Cost of implementing 
compressed workweek 


m@ Savings associated 
with telecommuting 


™ Measuring productivity levels 


™ Measuring effectiveness 
of job sharing 


Examples — Work Experience Applications 


Cost of implementing compressed workweek — determining the cost of adding a 
compressed workweek schedule 


Savings associated with telecommuting — monitoring savings associated with 
telecommuting 


Measuring productivity levels — measuring the productivity levels resulting from 
flexible workweeks 


Job sharing — measuring the effectiveness of job-sharing plans 
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Five Key Questions 
About the Variable of Interest 


1. What is the variable of interest? 


2. What is the level of measurement? 


3. Do the data represent a population 
or a sample? 


4. Are the data suitable for grouping? 


Are there any other variables 
available or under consideration? 


Five Key Questions About the Variable of Interest 


In this seminar, we will consistently address five key questions about our variable of 
interest. In this module, these questions are simply mentioned without much explanation, 
as we are laying the groundwork. Each of these questions will be discussed in detail as we 
progress through the seminar. 
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Levels of Measurement 


@ Nominal 
@ Ordinal 
@ Interval 


H Ratio 


Levels of Measurement 


As you progress through the forms of measurement, each measure takes on an 
additional element of significance. 


= Nominal 

— Information that can only be used to categorise 
= Ordinal 

— Data that can be placed in a rank order 
@ Interval 


— While the zero point is arbitrary, equal and meaningful intervals between points 
on a scale exist and can be determined. 


= Ratio 


— With ratio scales, a true zero point exists; ratio measurements are meaningful and 
indicate how much of the quantity is being measured. 
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LEVELS OF MEASUREMENT 


Examples — 
Nominal Measurement 


_ Baseball — 1 

Uniform > | | = 
_ Numbers oe oc. 
bo / _ Classification 
| I : Salaried | 
lL _. _| oS eee i 


ed 


Examples — Nominal Measurement 
= Nominal measures include information that categorizes, such as job descriptions. 


m What are some other examples of nominal measurement that you work with? 
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LEVELS OF MEASUREMENT 


Examples — 
Ordinal Measurement 


Olympic Medals 


Silver Bronze 


Health Care Options 
1. High deductible 
2. Medium deductible 
3. Low deductible 


Performance Ratings 
_ Distinguished 
Above standard 
Standard 

2 | Below standard 
El Unsatisfactory 


Examples — Ordinal Measurement 


Ordinal measurements are used to rank order information. Three common examples 
are shown above. 


Other examples: 
™ Class standing 
1. Senior 
2. Junior 
3. Sophomore 
4. Freshman 
m@ Attitudes toward training programs 
The same concept can be depicted verbally or visually: 


5. Very Positive 


- Very Very 
4. Positive Negative Neutral Positive 
3. Neutral O———OO0O——Oo—__O——_O 
2. Negative 
1. Very Negative 
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LEVELS OF MEASUREMENT 


Characteristics of 
Interval Measurement 
@ Equal intervals 
@ No true zero 
@ Zero point arbitrary 


m Differences have meaning 


Characteristics of Interval Measurement 


Equal intervals — There are equal intervals between the points being analysed. 
No true zero — There are no true zero points on the scales. 
Zero point arbitrary — Assignment of a zero point is arbitrary. 


Differences have meaning — Differences along each scale have meaning. 
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LEVELS OF MEASUREMENT 


Examples — 
Interval Measurement 


TEMPERATURE 


Point 


— 212 € Bolling 5 — 199 
hs 


— 50 


50 
Freezing } 
32 € point > 0 


Fahrenheit Celsius 
(Centigrade) 


Examples — Interval Measurement 
A good example of an interval scale is the Fahrenheit scale for temperature. 


Calendars are another example of interval measurement: the Chinese, Christian and 
Hebrew calendars illustrate the arbitrary zero point that is characteristic of interval 
measurement. 
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LEVELS OF MEASUREMENT 


Characteristics 
of Ratio Measurement 


m@ Ratios make sense 
m@ Measurements indicate “how much” 
@ True zero points exist 


@ Ratios have meaning 


Characteristics of Ratio Measurement 


Ratios make sense — Ratios from one point to the next make sense. 


Measurements indicate “how much” — Ratio measurements indicate “how much.” 


True zero points exist. 


Ratios have meaning. 
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LEVELS OF MEASUREMENT 


Examples — 
Ratio Measurement 


Money (amount earned) 


Number of employees 


Number of parts produced 
Number of defects 
Turnover 


Health care premiums 


Pension benefits 
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LEVELS OF MEASUREMENT 


Data Matrix 


(Variation ratio*) 


# Only comparative statements 
regarding “same as” or 
“different from” can be made. 


Nominal 


(Qualitative data; 
weak measurements) 


m Range 
# |nterquartile range 


m Median 
# Other percentile points 
# Quartile points 

# Decile points 


# Rank ordering is possible. 
» Comparative statements such 
as “greater than,” “equal to,” 

or “less than” can be made. 


Ordinal 


(Qualitative data; 
weak measurements) 


(Variance) 
s Standard deviation 


w Mean: 
Weighted mean 

Unweighted mean 

~- (Trimmed mean*) 


w An arbitrary zero exists. 
# Equal intervals from point to 
point can be established. 


interval 


(Quantitative data; 
strong measurements) 


w All of the above have 


Ratio # A true, absolute zero exists. # All of the above have 
(Quantitative data; # Ratio comparisons can be meaning. meaning. 
strong measurements) made. All the above 
statements are satisfied. 
Module 1 Module 6 Module 7 


* Not discussed in this seminar 


We will refer to the data matrix throughout the rest of the course. So far, we understand 
that nominal and ordinal data are weaker levels of measurement, and therefore we will be 
able to apply only a few statistical tools. Interval and ratio are stronger levels of 
measurement, allowing us to apply more advanced statistical tools. 
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Levels of Measurement 


Determine the level of measurement for each of the following items. Place N next to each 
nominal measurement, O next to each ordinal measurement, | next to each interval 
measurement and R next to each ratio measurement. 


Religious classification (Protestant, Catholic, Moslem, Jewish, Other, None) 

A list of temperatures in European capitals, based on Celsius temperature scale 
College class standing (freshman, sophomore, junior, or senior) 

A list of the ages of your senior managers 


Political party affiliations (e.g., Republican, Democrat, Libertarian, 
Independence, Green, Peace and Freedom, etc.) 


Heights of machinists 

Rank order of the heights of a group of machinists 

Weights of suitcases stored on an airplane 

Job performance as measured by the number of parts produced 
Job performance as measured by a rating scale 

Gender (male or female) 

Months of experience in remuneration and/or benefits 

Number of previous college-level statistics courses completed 
Ideal room temperature (in Fahrenheit) 


Anxiety level toward GR2 on a 10-point scale, with 1 representing very 
comfortable and 10 representing very anxious 


Number of employees in the HR department 


Number of employees serviced by the HR department 
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Can You... 


Discuss why HR professionals 
collect and use data? 


Identify five key questions to ask 
about the variable of interest? 


Identify the levels of measurement? 
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Module 1 Quiz 


1. Why do HR professionals collect and use data? 
A. To support personal agendas 
To respond to share price volatility 


To improve chances of making sound decisions 


90 B 


To increase health care costs 


A true, absolute zero exists for which level of measurement? 
A. Nominal 

B. Ordinal 

C. Interval 

D. Ratio 


3. Salary is an example of which level of measurement? 
A. Nominal 
B. Ordinal 
C. Interval 
D. Ratio 


4. In order to satisfy ordinal level data requirements, you must be able to do which of 
the following? 


A. Have a true zero 

B. Have a constant interval from point to point 

C. Be able to meaningfully arrange the data in rank order 
D 


. Be able to establish ratios 


5. Rating employee performance on a five-point scale ranging from “5” (outstanding) 
to “1” (unsatisfactory) is most likely an example of which level of measurement? 


A. Nominal 
B. Ordinal 
C. Interval 
D. Ratio 
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Module 2 


Percents and 
Related Issues 


WorldatWork B 
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Module 2: Percents and Related Issues 


Introduction 


A number of aspects of remuneration, benefits and total rewards rely on the 
calculation of percents. However, percents themselves do not tell us anything about 
absolutes; they merely report relative relationships. In this module we will discuss a 
range of topics — first defining the concept of percents, then illustrating and defining 
some commonly used ratios in human resources. 


Objectives 


By the conclusion of this module, you will be able to accomplish the following: 
1. Define and calculate percents. 
2. Apply percents and ratios to examine internal and external equity, including: 
gm = Individual compa-ratio 
= Department compa-ratios 


Market index 


= Percent difference 


3. Develop salary ranges. 


4. Identify common examples of the use of percents in benefits management. 
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Percents 


#@ What is a percent? 
H How is it calculated? 


m@ Why do we convert to percents? 


Percents 


Percents are used often in everyday life. In the HR profession, percents are very valuable 
tools. 


m@ What is a percent? 
— A tool for comparing one value to another 
— A ratio, expressed in terms of 100 
@ How is it calculated? 
— Divide one value into another. 
— Multiply this ratio by 100. 
m Why do we convert to percents? 


— To have a common base from which we can make comparisons — 
a base of 100 


The value we wish to compare against 


is placed in the denominator. 
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PERCENTS 


The Calculator 
Layout 


ra] 
se 


Nes 
Second 
function 


ny 


BEC 


OOOO BOB 
ADCODO BEE 


POD CORO E 


Feenae 


NS 


The Calculator Layout 


The “2nd” and “Format” keys allow you to change the number of decimal places. 
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PERCENTS 


Example — Percent in a Group 


m 50 employees total 

m@ 40 are women 

= What percent are women? 
Answer: 


(+) x 100 = 0.80 x 100 = 80% are women 


Example — Percent in a Group 
We have 50 employees in total, 40 of whom are women. What percent of our employees 
are women? 


We want to compare the number of women in our company against the total number of 
employees. Thus, the total (50) is entered as the denominator. 


What percent are men? 


10/50 or 20% 
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PERCENTS 


A Voluntary Savings Plan 


m@ Steve Yang earns 50,000. 


@ He contributes 4% of his gross 
income to a voluntary savings plan. 


= What is his total contribution to the 
voluntary savings plan? 


A Voluntary Savings Plan 

An employee earning 50,000 contributes 4% of his or her salary to the voluntary savings 
plan. What is the total contribution for the employee? 

Answer: 

50,000 x 4% = 2,000 

If there is a company match of 50%, what will the total contribution be? 

Answer: 

50,000 x 4% = 2,000 

+ 1,000 = 3,000 


Note: Voluntary savings plans are retirement plans typically funded by both employee and employer contributions. 
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PERCENTS 


A Voluntary Savings Plan 
Expanded to Two Employees 


Makayla 
Johnson 


Steve 
Yang 


Employee Salary 


Employee Contribution 
(Percent) 


Employee Contribution 


Taxable Income 
to Employee 


A Voluntary Savings Plan Expanded to Two Employees 


As shown above, Steve Yang earns 50,000 per year and contributes 4% of his annual 
salary to the voluntary savings plan. The 4% equates to 2,000/year, which then would 
decrease his taxable income by 2,000 to 48,000. 


The second employee, Makayla Johnson, earns 20,000/year and contributes 8% to the 
voluntary savings plan. The 8% equates to 1,600/year, which decreases her taxable 
income by 1,600 to 18,400. 
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PERCENTS 


What Is a Percent Increase? 


@ Salary budget this year: 73,327 
™@ Salary budget next year: 81,106 
= What is the percent increase? 


Calculation Method: 
81,106 — 73,327 
teal 


7,779 
73,327 


= 0.1061 x 100 
= 10.6% (rounded) 


% increase = x 100 


What Is a Percent Increase? 


Your salary budget for this year is 73,327. You plan to have a salary budget of 81,106 for 
next year. What percent budget increase will you have? 


Determine the difference between next year’s budget (81,106) and this year’s budget 
(73,327): 


81,106 — 73,327 = 7,779 
Divide the difference (7,779) by this year’s budget: 


7,179 
73,327 


= 0.1061 or 10.6% increase 
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Applying Percents and 
Ratios to Examine Internal 
and External Equity 


& Individual compa-ratios (CRs) 
m@ Department compa-ratios 

= Market indices 

@ Examining relationship 


@ Percent difference 


Applying Percents and Ratios to Examine Internal and External Equity 


In addition to voluntary savings and budget increases, HR practitioners use percents and 
ratios quite frequently. Regarding remuneration, several examples would be: 


Individual compa-ratios (CRs) — computing individual compa-ratios 


Department compa-ratios — computing department compa-ratios 


Market indices — computing market indices 


Examining relationship — examining the relationships among: 
— Internal equity (compa-ratios) 
— External equity (market indices) 


= Percent difference — calculating percent difference 
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APPLYING PERCENTS AND RATIOS 


Individual Compa-ratio 


What is an individual compa-ratio? 


coeenne 


Salary 


Individual Compa-ratio = —————_ 
Midpoint 


4 
; 
5S SS call 


Individual Compa-ratio 


Individual compa-ratio is a tool that helps compare an individual’s actual pay to the 
midpoint of the pay structure. 
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APPLYING PERCENTS AND RATIOS 


Individual Compa-ratio (Cont'd) 


= Employee salary: 36,000 
= Midpoint: 40,000 


@ Individual compa-ratio? 


Individual _ Salary _ 36,000 _ 9 o9 | 
Compa-ratio Midpoint 40,000 — 


SE 


Individual Compa-ratio (Cont'd) 


An employee’s salary is 36,000, while the midpoint of the salary structure for that 
person’s pay grade is 40,000. What is the individual compa-ratio for that person? 


Note: In practice, a compa-ratio is expressed as a decimal (.90), as a percent (90%), or as a whole number (90). 
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APPLYING PERCENTS AND RATIOS 


Department Compa-ratio 


individual 
CR 


Employee Base Midpoint 
Freddie 36,000 40,000 90% 


Carri 110,000 | 100,000 110% 


m@ Average of 90% and 110% is 100% 


= However, total salaries /total midpoints 
is 146,000/140,000 = 104.3% 


Department Compa-ratio 
Why is total salaries/total midpoints correct? 


Answer: 


The first employee is under the midpoint by 4,000, while the second employee is over the 
midpoint by 10,000. In total, pay is above the midpoints. 
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APPLYING PERCENTS AND RATIOS 


Market Index 


What is a market index? 


A commonly used ratio that compares 
a salary to a market average. 


| 


iS SEALS RSENS SSE 


Market Index 


“Market index” is an external measure. It is a ratio that compares internal salaries to the 
external market salary average. It is expressed in terms of a particular salary for a job, 
relative to the market salary for that same job. 


“Market” refers to the data from salary surveys. Some companies use the market average, 
others use the median (50* percentile) or other measures of location. 
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APPLYING PERCENTS AND RATIOS 


Market Index (Cont’d) 
@ An individual makes 28,000 per year 


m@ Market: 35,000 
#@ What is the market index? 


SPREE 55 REESE EEE EEE ER ESS 
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APPLYING PERCENTS AND RATIOS 


Examining Relationship: 
Poipere™ vs. Market Index 


i 


Market ———s cup 
Index — PAY K _ ratio | 


Son Wis : tore) 


ie MARKET ac [ MIDPOINT a 


"Sears 


Examining Relationship: Compa-ratio vs. Market Index 
What is the difference between a compa-ratio and a market index? 


A compa-ratio compares salaries to structure midpoints, yielding an internal index. 
A market index compares salaries to market salaries, yielding an external index. 


The link between the internal and external is the pay structure policy. Where do you set 
your midpoints to the market? 
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APPLYING PERCENTS AND RATIOS 


Percent Difference 


= Market is 44,000 and your 
company’s midpoint is 40,000. 


m@ Find the percent difference. 


Approach 1: Approach 2: 
44,000-— 40,000 = 4,000 = 44,000- 40,000 = 4,000 
Difference _ 4,000 Difference _ 4,000 
Grade Midpoint 40,000 Market 44,000 
= 10% = 9.09% 
The midpoint needs The market must 
to be increased by 10% decrease by 9.09% 
to match the market. | to match the midpoint. 


Percent Difference 
What is a percent difference? 
Percent difference is a tool for comparing two items. 


There are two approaches to percent difference. 


Example 


Suppose your survey data indicate that the market for positions in grade 12 is 44,000. 
Your company’s current midpoint for grade 12 is 40,000. 


How do you determine the appropriate percent difference in order for your 
company’s grade 12 midpoint to equal the market? 


Both approaches shown are technically correct, 


but Approach 1 is preferable. 
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APPLYING PERCENTS AND RATIOS 


Why Is Approach 1 Preferable? 


The data over which you have control 


should be the denominator 
of the calculation. 


The percents differ 
based on the denominator. 


Why Is Approach 1 Preferable? 
m@ Rule 


— The data over which you have control should be the denominator of the 
calculation. 


m™ Inthe previous example, since you have control over the salary midpoint (whereas you 
do not have control over the market), using the salary midpoint is the preferred method. 
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APPLYING PERCENTS AND RATIOS 


Range Penetration 


Pay — Minimum 
Maximum — Minimum 


* aaa EEE 8 


If range penetration i is: Then pay is: 


At the minimum of the pay structure 
100% At the maximum 


Range Penetration 


The range penetration is an alternative approach to the compa-ratio. 
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APPLYING PERCENTS AND RATIOS 


Range Penetration Chart 


Minimum Maximum 


Mike - Service: 9 Rating: 
Natalia - Service: 15 Rating: 


Sophia - Service: 3 Rating: 


Christy - Service: 1 Rating: 
Stephanie - Service: 5 Rating: 85% 
Alex - Service: 7 Rating: 
Erin - Service: 2 Rating: 


Michael — Service: 6 Rating: 


0% 33% 67% 100% 
Still progressing Fully competent Exceptional 
Limited experience Fully capable Consistently exceeds 
New to job Moderate experience _ Significant experience 
Recently promoted Solid contributor Ready for promotion 


Range Penetration Chart 

In this example, the pay range is divided into thirds. Each employee's position within the 
pay range is displayed by the horizontal bars. The labels on the left indicate the 
employee's name, service and performance rating. 


This graph is a useful tool to help managers allocate their merit increase budget. For 
example, the manager may want to give a larger than average increase to Michael, who 
has a high performance rating yet is low in the range. On the other hand, the manager may 
decide to give no increase or a small increase to Stephanie who has a low performance 


rating yet is high in the pay range. 
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Developing Salary Ranges 


Minimum Midpoint Maximum 


Market 


Range 


Developing Salary Ranges 


Elements required for developing a salary range: 


Determine the range (minimum to maximum). 
Determine the range spread (percent difference between minimum and maximum). 
Determine the midpoint. 


Compute the range from the range spread and midpoint. 


Pay structures are administrative tools designed to be internally fair, externally 
competitive and cost effective — all in support of the organisation’s remuneration 
strategy. Pay structures typically group jobs of equal internal value or worth into 
grades with competitive salary range (i.e., minimum to maximum dollar value). 
Jobs are assigned to grades and salary ranges based on job content and market 
value, and typically are not assigned based on incumbent value. 
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DEVELOPING SALARY RANGES 


Calculate the Range Spread 


Given the minimum and maximum, 
determine the range spread. 


Maximum — Minimum 


Range Spread = — 
Minimum 


| 
x 100 | 
i 


MT TTT TT TT TTT 


Calculate the Range Spread 
Given the minimum and maximum, determine the range spread. 


The range spread is defined as the percent the maximum exceeds the minimum, relative to 
the minimum. 


Expressed in mathematical terms: 


Maximum — Minimum 
Minimum 


x 100 


Range Spread = 
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DEVELOPING SALARY RANGES 


Calculate the Range Spread (Cont'd) 


You have a 27,000 per year maximum and 
an 18,000 per year minimum. 


The range spread is calculated as follows: 


| 27,000 — 18,000 _ 199 — -2:000 .. 100 = 0.5x 100 = 50% | 
18,000 18,000 | 


| 


Calculate the Range Spread (Cont'd) 


For this example, the minimum has been predetermined at 18,000 and the maximum has 
been predetermined at 27,000. 
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DEVELOPING SALARY RANGES 


Determine the Midpoint 


The midpoint is 22,500: 


Determine the Midpoint 


In this example, the midpoint is 22,500 (calculated as halfway between the minimum and 
maximum). 


Example: 


Maximum — Minimum + Minimum = Midpoint 


27,000 — 18,000 


5 + Minimum = Midpoint 


a“ + 18,000 = Midpoint 


4,500 + 18,000 = Midpoint 


22,500 = Midpoint 
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DEVELOPING SALARY RANGES 


Reality Testing 


Maximum 27,000 


Midpoint 22,500 50% 


Minimum 18,000 


(27,000 — 22,500) + 22,500 = 0.20, for a 20% range spread 
and: 
(22,500 — 18,000) + 18,000 = 0.25, for a 25% range spread 
yet: 
The grade range spread is symmetric, 
with a 50% range spread 


Reality Testing 


While the results of the above calculations may seem inconsistent, remember that we are 
using different denominators in the two separate calculations. 


Suppose you want a 50% range spread. Will the following calculation yield a 50% range 
spread? 

™ Take 75% of the midpoint to get a minimum of 16,875. 

m Take 125% of the midpoint to get a maximum of 28,125. 

Answer: 

The calculations above actually yield a total range spread of 66.67% — not 50%. 
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DEVELOPING SALARY RANGES 


— 


Compute the Minimum and 
Maximum When the Midpoint 
and Range Are Known 


a Midpoint 
Minimum = —————_"— 
a [Pestree Range Spree) 


= 
ShnreesneesanenaanicnenionteneanenesnaenienmeamarR 


Compute the Minimum and Maximum 
When the Midpoint and Range Are Known 


This is a useful formula for determining the minimum and maximum of a range, once a 
midpoint and the desired range spread have been selected. 


We typically use the result of a salary survey to determine the midpoint of our salary range. 
We then apply the desired range spread to determine the minimum and maximum value 


for the range. 


This process may also be useful with converting traditional grades to broadbands, where 
the range is set based on a fixed relationship to a control point in the range. 
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DEVELOPING SALARY RANGES 


Complete Process 


@ Market rate is 44,000. We decided to have 
a 40% grade range. 


= Compute the minimum of the salary range: 


pat Midpoint 
Minimun = ——_———$_$_$_$__$_$____$__——— 
4: Desired Range Sere) 


2 


Complete Process 


The minimum is equal to the midpoint divided by 1 plus one-half of the desired range 
spread. 
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DEVELOPING SALARY RANGES 


Compute the Maximum 
of the Salary Range 


Maximum = Minimum x (1 + Desired Range Spread) | 


EIEN OI 


For this example, the maximum is determined 
as follows: 


36,667 x (1+ 0.40) 
= 36,667 x 1.40 
= 51,333 


H 
H 


Compute the Maximum of the Salary Range 


If we know the minimum, there is also a useful formula for determining the maximum of a 
salary range: 


Maximum = Minimum x (1 + Desired Range Spread) 
For this example, the maximum is determined as follows: 
36,667 x (1 + 0.40) 
= 36,667 x 1.40 
51,333 


Thus the grade range is: Maximum 51,333 
Midpoint 44,000 
Minimum 36,667 


We might wish to round (to the nearest 100) the minimum (to 36,700) and the 
maximum (to 51,300), but this is purely a matter of preference. 
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Percents in Benefits Management 


= Company-sponsored/ premium 
increases 
m@ Use/ prevalence data 


@ Costing alternatives 


= Budget increases 


= Percent of total remuneration 
related to benefits 


Percents in Benefits Management 


Ratios also have applicability to benefits professionals. 
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PERCENTS IN BENEFITS MANAGEMENT 


Company-Sponsored/ 
Premium Increases 


™ Health and welfare cost to your company last 
year was 1,200,000. Your insurance company 
has raised your total cost to 1,368,000. 


m What is the total percent increase of this 
adjustment? 


1,368,000 — 1,200,000 168,000 414% 
_—_—— eee OO Oo enn f 
1,200,000 1,200,000 


Company-Sponsored/Premium Increases 


The insurance company raised your rates 14% over last year's rates. 
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PERCENTS IN BENEFITS MANAGEMENT 


Use/Prevalence Data 


@ Your EAP provider mentions that calls 
to the EAP line increased 27% over last 
month. You know last month there were 
47 calls. How many calls were received 
this month? 


47 + 12.69 = 60 calls 


_ nee EEE 


: 7 
AT x 27% = 12.69 | 
_I 


Use/Prevalence Data 


The EAP (Employee Assistance Program) line received approximately 60 calls this month. 
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Can You... 


@ Define and calculate percents? 


@ Apply percents and ratios to examine 


internal and external equity, including: 


— Individual compa-ratio? 

— Department compa-ratios? 
— Market index? 

— Percent difference? 


™@ Develop salary ranges? 


@ Identify common examples of the use 


of percents in benefits management? 
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Module 2 Quiz 


1. You spent 30% of your HR budget on benefits three years ago, 35% two years ago, 
and 40% last year. When was the most money spent on benefits? 


> 


Three years ago 
B. Two years ago 

C. Last year 
D 


. It cannot be determined based upon the information given. 


2. What is the individual compa-ratio for an employee whose salary is 25,000 per year 
when the midpoint of the salary structure is 30,000? 


A. 0.17 
B. 0.83 
C. 1.20 
D. 0.80 


3. If you earn 50,000 per year and your co-worker in the same job earns 60,000, what 
percent adjustment do you need to equal your co-worker? 


A. 18% 
B. 20% 
C. 10% 
D. 80% 


4. You have obtained a set of market data that show your senior MIS manager has a 
market average of 72,000. If you have a 50% range spread, what would the new 
minimum and maximum be if the new midpoint is 72,000? 


A. 54,000; 90,000 
B. 57,600; 86,400 
C. 60,000; 84,000 


D. They cannot be determined from the data given. 
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Module 2 Quiz (Cont'd) 


5. An employee earning 50,000 annually contributes 8% of his/her salary to a 401(k) plan 
in the first year of participation. Excluding investment earnings, how much is in this 
employee’s account at the end of the first year? 


A. 2,000 
B. 4,000 
C. 5,500 
D. 8,000 


6. At 10% investment return, what would be in the same employee’s account including 
investment earnings at the end of the year? 


A. 2,160 
B. 4,400 
C. 5,940 
D. 8,640 


7. George and Kevin agree to job share. George will work two 10-hour days and Kevin 
will work three 7-hour days. What percent of the total weekly hours is George going to 
work? 


A. 39.5% 
B. 50% 

C. 48.8% 
D. 51.2% 
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Module 2 Practice Questions 


1. You have 660 hourly, 420 salaried part-time and 320 salaried full-time employees. 
What is the percentage of salaried part-time employees? 


A. 23% 
B. 30% 
C. 45% 
D. 47% 


2. Twelve percent of your workforce of 9,500 lives in Texas. How many people in your 
workforce live in Texas? 


A. 962 
B. 1,000 
C. 1,140 
D. 1,212 


3. On January 1 of last year, you had 1,000 employees. During that year, your company 
lost money, and on December 31 there was a 20% reduction in the workforce. By July 
of this year, business again picked up, and you then increased your workforce by 20%. 
Do you now have 1,000 employees again? If not, how many do you have? 


A. Yes; 1,000 employees 

B. No; 800 employees on Dec. 31, 960 in July 
C. No; 800 employees on Dec. 31, 850 in July 
D 


. Not enough information to compute 


4. A survey shows the average pay for senior accounting clerks is 1,570 per month. 
Your midpoint for senior accounting clerks is 1,670 and the average actual pay is 
1,550. What is your compa-ratio for accounting clerks? What is your market index? 


A. .928 compa-ratio; .987 market index 
.928 compa-ratio; 1.064 market index 


.987 compa-ratio; 1.064 market index 


90 D 


.987 compa-ratio; 1.013 market index 
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Module 2 Practice Questions (Cont'd) 


5. An executive is hired into your firm at a base salary of 80,000 per year. He is 
guaranteed a 6,000 salary increase after six months of employment. What percent 
increase will he receive after six months? 


A. 6.0% 
B. 6.5% 
C. 7.0% 
D. 7.5% 


6. Suppose you were given the following data for salary grade 007: 


Grade Minimum Midpoint Maximum 


007 4,800 6,000 7,200 


What is the salary range spread of grade 007? 
A. 35% 

B. 37.5% 
C. 40% 
D. 50% 


7. If the midpoint is 2,000 and the desired range spread is 50%, what are the 
minimum and the maximum values? 


A. 1,500 minimum; 2,500 maximum 
B. 1,600 minimum; 2,400 maximum 
C. 1,700 minimum; 2,300 maximum 
D 


. 1,750 minimum; 2,250 maximum 
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Module 2 Practice Questions (Cont'd) 


8. If the minimum is 2,000 and the desired range spread is 40%, what is the midpoint? 
A. 2,000 

B. 2,300 

C. 2,400 

D. 2,800 


9. Suppose that the midpoint for grade 14 is 950, and the maximum is 1,200. Assuming 
that the structure is symmetric, what is the range spread? (rounded to 
the nearest percent) 


A. 21% 
B. 26% 
C. 52% 
D. 71% 
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Module 3 


Time Value of Money 


WorldatWork B 
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Module 3: Time Value of Money 


Introduction 


Why is the time value of money important? Because the value of money changes with 
time, whether the figures change or not. Some real world examples would be the 
balance of a college savings fund, a retirement savings fund balance, the value of a 
home, a lump sum payment from a retirement plan versus an annuity payment, or the 
growth rate of base salaries or benefits plan costs. In both remuneration and benefits, 
the time value of money is important when calculating program costs, employee 
benefits, and salary midpoint progressions. 


Objectives 


By the conclusion of this module, you will be able to accomplish the following: 
1. Define time value of money. 

2. Calculate compound interest and compound salary growth rate. 

3. Calculate constant midpoint progression. 
4 


Define annuity payments. 
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Time Value of Money 


T I M E (pb) 


INTEREST RATE 


(A%) 
PV FY 
(OLD) (NEW) | 
PV = present value FV = future value 


Time Value of Money (TVM) 


The concept of time value of money is relatively simple. As depicted in the visual above: 


Present value (PV) will increase 
to a future value (FV) with the 


inclusion of time (# PD) and interest (A%). 
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Time Value of Money (Cont'd) 
= Money in the past/money today / 
money in the future 


= Opportunity costs and lost earnings 
potential 


= Applications of TVM in business / 
HR decisions 


= Calculations easier now 


Time Value of Money (Cont'd) 


= Money in the past/money today/ money in the future — Money in hand today is 
worth more than money promised at some future time, because it can be invested with 


interest and grow over time. 


= Opportunity costs and lost earnings potential — Opportunity costs and lost earnings 
potential should influence decisions. 


— Opportunity costs refers to what is given up when a decision is made 
(the “trade-off’). 


— The lost earnings potential is the opportunity cost. 


= Applications of TVM in business/HR decisions — The application of TVM principles 
to business/HR decisions is now commonplace. 


= Calculations easier now — Present-value tables, calculators, spreadsheets and 
Internet “calculators” have made calculations easy. 


GR2 w 3.4 


© WorldatWork. All rights reserved. 


TIME VALUE OF MONEY 


Why the Value of Money 
Over Time Is of Interest 


Problem Solving in the Real World 


m™ Savings account planning 


@ Alternative purchase and 
investment decisions 


m™ Equities vs. fixed income investing 


Why the Value of Money Over Time Is of Interest 
™ Savings account planning 


— Wedding fund 

— College savings fund 

— Home purchase fund 

— Retirement savings fund 


= Alternative purchase and investment decisions 


— Home ownership 
— Personal computer 
— Automobile 
e Direct purchase (cash) 
e Bank loan 
e Lease 
e Balloon payment 


= Equities vs. fixed income investing 


— Equities (shares) vs. fixed income (bonds) 
e Capital gains 
e Dividend streams 
e Yield to maturity 
e Par value and coupon interest 
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TIME VALUE OF MONEY 


Applications to Total Rewards 
Problem Solving 


@ Retirement plan payments 
m™ Executive remuneration payments 


= Growth rate of base salary or 
benefit plan costs 


@ Alternative work-life programs 


Applications to Total Rewards Problem Solving 


= Retirement plan payments — Determine contributions and fund performance 
necessary to deliver future benefits. 


— Final average pay benefit 
— Lump sum vs. annuity 
— Actuarial equivalent 


m= Executive remuneration payments — Assess value of current vs. future remuneration 
and benefits to negotiate an employment offer. 


— Sign-on bonuses 

— Long-term cash bonuses 

— Termination payouts 

— Deferred remuneration plans 


= Growth rate of base salary or benefit plan costs — Formulas can be applied to 
determine growth rate (e.g., sales, population, participation, salaries, costs, earnings 
per share, share price appreciation). 


= Alternative work-life programs — TVM can be used to anticipate return on an 
investment in an on-site child- and elder-care centre, or to compare the cost of adding 
an additional paid holiday vs. increasing the vacation schedule. 
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Compound Interest 


= Compounding 


= Compound vs. simple interest 
— Simple interest 


— Compound interest 


Compound Interest 


= Compounding - The process of finding future values (of a payment or a series of 
payments) is called compounding. 


= Compound vs. simple interest 


— Simple interest — Interest is applied at the end of the period and only on the 
beginning balance or principal. 


— Compound interest — Interest is applied during the period, which results in a 
return not only on the principal amount but also on the interest (thus, interest on 
interest, or “compounding’”). The interest is applied at certain intervals or 
frequencies (i.e., monthly, quarterly) during the total time interval being studied. 
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COMPOUND INTEREST 


Future Value Definition /Formula 


m@ FV = Principal + (Principal x %i) 

m FV=PVv(1+i)% 

m™ This equation represents the future value of: 
— Money invested today 


— ForN periods of time 


— Atacompounding interest rate of %i 


® Variations of the equation can be used to 
determine FV, PV, %i or N. 


Future Value Definition/Formula 


™ The formula provides a means of seeing the future value (FV) of an investment (PV) 
that receives compound interest (%i) over a period of time (N). 


H Utilising this formula to calculate future value, given that you have any three of the four 
variables (FV, PV,%i, N), you can solve for the missing fourth variable. 


GR2 w 3.8 


© WorldatWork. All rights reserved. 


COMPOUND INTEREST 


Calculation of Future Value 


& 


a 


PV-1,000 | FV = 

%i= 4% | = 1,000 x (1.0816) 
N = 2years | = 1,081.60 
FV=? 


& 
i 2 
ed 


Calculation of Future Value 


PV = Present value 

%i = Interest rate per period 
N = Number of periods 

FV = Future value 


Suppose you have 1,000 that you invest in a savings account earning 4% annually over 
two years. 


Note that in year one, interest is earned only on the principal amount. In year two, interest 
is again earned on the principal amount (the original 1,000) but also on the interest earned 
in year one (40). Thus, the notion of “compounding” interest is born. In other words, 
interest is earned on interest. The amount earned in year two on the interest made in year 
one is 40.00 times 4%, or 1.60. Thus, the second year’s interest is 41.60. 


Although we can compute the amount utilising the relatively simple algebraic expression 
illustrated above, we can also use the calculator to save time making these calculations. 


We will utilise the “A%” worksheet mode to make these calculations. 
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COMPOUND INTEREST 


Percent Change/Compound Interest 
Worksheet (A%) 


We will learn to use 
the calculator to simplify our work. 


OLD Present value (PV) 
NEW Future value (FV) 


%CH Percent change 
(interest rate per period) 


#PD Number of periods (N) 


Percent Change/Compound Interest Worksheet 


Given any three of these values, we can always determine the fourth value. This is the 
essence of algebra. 
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COMPOUND INTEREST 


Calculation of Future Value 


= Compute using percent change/ 
compound interest worksheet 


— Using the A% function 


PV = 1,000 

%i = 4% per period 
N = 2 years 

FV = ? 


Calculation of Future Value 


Now let’s go back to our original example. 


PV = 1,000 

%i = 4% per year 
N = 2 years 

FV = ? 


Let’s set up the problem for the TI BAII Plus calculator. We know three of the four 
variables, so we can solve for the unknown variable “FV.” 
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COMPOUND INTEREST 


Calculation of Future Value (Cont'd) 


PROCEDURE: PRESS: DISPLAY: 


1. Select percent change/ 2nd A% OLD = (Old contents) 
compound interest worksheet 


2. Clear worksheet [ OLD = 0.00 | 


3. Enter present value (1,000) 1,000 Enter OLD = 1,000.00 
4. Move to interest display (%CH) ob | v i | %CH = 0.00 | 
5. Enter interest per period (4%) 4 Enter %CH = 4.00 
6. Move to # of periods display | #PD = 1.00 | 
7. Enter # of periods (2) 2 Enter #PD = 2.00 
8. Move to future value display | aS | | a | | NEW = 0.00 | 
(NEW) 
9. Compute future value CPT NEW = 1,081.60* 


Answer: FV = 1,081.60 


* Indicates computed number 


Note: When you input the interest into the %CH, use the whole number, not the decimal (for 4%, use 4 — 
not .04). 


Note: The “A%” worksheet in the TI BAII Plus calculator is like a spreadsheet on a rotating cylinder with four 
cells for the four variables in the PV/FV calculations. You can scroll to a particular variable utilising the 


scrolling arrows (@ or W) to get to PV, FV, %i or N. If you know any three variables, you can solve for 
the unknown variable. 
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COMPOUND INTEREST 


= Compounding quarterly 


x 4 periods/ year 


i 
i 


= 1,082.86 


FV =? 


= We have 1,000 to invest. 


8 periods total | 


= How much money will we have in two years? 


Se Sea 
_ PV = 1,000 | _ FV = 1,000 x [1+ (0.04 + 
(Mi = Me 4=1% | _ 4. 900.x (4+ 0.01)" 
N = 2years t 

| = 1,000 x (1.082857) 


(to 6-decimal-place accuracy) 


— Manual calculation of future value 


Calculation of Future Value (Cont'd) 


* senate OSS II 


Calculation of Future Value (Cont'd) 


Suppose we have 1,000 to invest. We are able to earn 4% investment income annually, 
compounded quarterly (i.e., 1% per quarter, which is the result of 4% annual income 
divided by four compounding periods per year, or 4% + 4 = 1%), and we intend to invest 
this money for two years (which results in the number of compounding periods becoming 
2 years x 4 compounding periods per year = 8 periods). How much money will we have in 


two years? 


FV = PV(1+i)N 


PV = Present value 

%i = Interest rate per period 
N = Number of periods 

FV = Future value 
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COMPOUND INTEREST 


Calculation of Future Value (Cont'd) 


= Compounding quarterly 


— Compute using percent change/ 
compound interest worksheet 


e Using the A% function 


PV = 1,000 
N = 2years~x 4 periods per year = 8 
Mi = 4%+4=1% 


Calculation of Future Value (Cont'd) 


It is important to always set up the calculator with the three known variables. Then it is a 
simple matter to find the fourth variable. 


Note: If you are calculating something other than annual compounding, the conversions need to be made on 
the “%i” and the “N” number of compounding periods. This is a simple adjustment. 
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COMPOUND INTEREST 


Calculation of Future Value (Cont'd) 


PROCEDURE: PRESS: DISPLAY: 


1. Select percent change/ 2nd A% OLD = (Old contents) 
compound interest worksheet 


2. Clear worksheet | OLD = 0.00 | 


3. Enter present value (1,000) 1,000 Enter OLD = 1,000.00 
4. Move to # of periods display | #PD = 1.00 | 
5. Enter # of periods (8) 8 Enter #PD = 8.00 
6. Move to interest display (Y%CH) [ a | %CH = 0.00 | 
7. Enter interest per period (1%) 1 Enter %CH = 1.00 
8. Move to future value display(NEW) [| * | [ NEw = 0.00 | 
9. Compute future value CPE NEW = 1,082.86* 


Answer: FV = 1,082.86 


* Indicates computed number 


Note: We scrolled up, or in a counterclockwise direction on our cylinder in the “A%” worksheet. Alternatively, 
we could have scrolled downward (W) on the cylinder and enter the four variables in the reverse 
direction. We would enter PV, then skip FV, enter i, and then enter N. 


GR2 @ 3.15 


© WorldatWork. All rights reserved. 


COMPOUND INTEREST 


Calculation of Present Value 


The equation for obtaining present 
value is solved as follows: 


eA 
G4 


PV 


Calculation of Present Value 


The future value equation consists of four terms: PV, FV, i and N. Given any three of 
these terms, we can use algebra to solve for the fourth term. 


m@ Future value process in reverse 
— Present value is the future value process in reverse. 


— It’s the inverse of compounding and involves removing or unraveling compound 
interest. 


— How do you determine the amount of money you need to invest today in order to 
realise a specific future value? 


m PV=FV/(1+i)N 


— Present value calculations reflect the economic trade-off between money received 
today versus a future date, based on a length of time involved and the available 
earnings opportunity. 


— The greater the i, the smaller the PV needs to be. 


— The greater the N (i.e., the further in the future a sum is to be received), the less 
you have to invest at the current time. 
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COMPOUND INTEREST 


Calculation of Present Value (Cont'd) 


= Compound interest 
— Manual calculation 
e In 10 years, you will need 50,000 
e 8% compounded quarterly 
e How much to set aside now? 


Calculation of Present Value (Cont'd) 


Suppose you anticipate the cost for one year of college 10 years from now to be 
approximately 50,000. You can earn 8% annually on your investment, compounded 
quarterly from a mix of high yield bonds with quarterly interest payments and dividend 
reinvestment. How much money do you need to set aside now (present value) in order to 
have the necessary 50,000 (future value) in 10 years? 
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COMPOUND INTEREST 


Calculation of Present Value (Cont'd) 
= Compound interest 


FV = 50,000 
N = 10 years x 4 periods per year = 40 
%*i = 8% + 4 = 2% interest rate per period 
PV FV 


= 22,644.52 


Calculation of Present Value (Cont'd) 


You can calculate this by using the algebraic relationship between PV and FV established 
earlier. Note that if you divide both sides of the original equation by the quantity (1+i)%, you 
can solve for PV. 
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COMPOUND INTEREST 


Compound Salary Growth Rate 
(CSGR) 


A special case of compound interest 
FV —— Ending base salary 


PV —— Starting base salary 


N —— # of increase periods 
%i ——> CSGR 


Compound Salary Growth Rate 
What is it? 


A special case of compound interest that helps determine the rate at which a salary has 
grown. 
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COMPOUND INTEREST 


Calculation of Compound 
Salary Growth Rate 
™@ Jane’s current salary is 56,900. 


@ She was hired out of college 
five years ago at 32,000. 


#@ At what annual rate has her 
salary grown? 


Calculation of Compound Salary Growth Rate 


YL Ending Base Salary ————?__ 56,900 
i i a Starting Base Salary > -32,000 
N ———? —_ # of Increase Periods —_$—_—_—_——p 5 


%| ————————_ Interest Rate per Period —M———————>_ ? 
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COMPOUND INTEREST 


Calculation of Compound 
Salary Growth Rate (Cont'd) 


PROCEDURE: 


1 


Select percent change/ 
compound interest worksheet 


Clear worksheet 

Enter present value (32,000) 
Move to next display 

Enter future value (56,900) 


Move to number of periods 
display 


Enter # of periods 
Move to interest display 


Compute interest per period 


PRESS: DISPLAY: 
2nd A% OLD = (Old contents) 
fow= 00 
32,000 Enter OLD = 32,000.00 
re] [News o00] 
56,900 Enter NEW = 56,900.00 
| ¥PD = 1.00 
5 Enter #PD = 5.00 
CPT %CH = 12.20" 


Answer: CSGR = 12.2% 


* Indicates computed number 
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COMPOUND INTEREST 


The Rule of 72 


Years to double x Growth rate = 72 


72 


Years to double = ——————_ 
Growth rate 


The Rule of 72 
What is it? 


@ The rule of 72 is a commonly used concept in finance, allowing quick calculations of 
compound interest. 


@ The rule of 72 is an approximation. As the interest rate (or growth rate) increases, the 
rule becomes less accurate. For practical purposes, it usually provides a reasonable 
approximation until the interest rate begins to approach or exceed 20%. For lower 
interest rates, it is generally very accurate. 


What are its applications? 


™@ The rule of 72 provides a good approximation of how long it will take to double 
something, given a specified constant interest rate. 


™ Itis a valuable tool for understanding investments in retirement programs, 
supplemental income planning, and other financial planning processes. 
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COMPOUND INTEREST 


r 


Using the Rule of 72 
to Determine Time to Double 
™ Peter’s salary has grown at an annual rate of 
approximately 12.47% over the past five years. 


m If his salary growth rate continues to hold, how 
long will it take for his salary to double? 


72 
Growth rate 


en a 
12.47 


~ 5.77 


Years to double ~ 


Using the Rule of 72 to Determine Time to Double 


™ Note that by using the TVM feature of your calculator, you can determine that the 
actual number years for Peter's salary to double is 5.89830503, given a compound 


salary growth rate (CSGR) of 12.47. Thus, 5.77 years is a reasonably good 
approximation to the actual value of 5.90 (rounded). 


m™ The rule can be tested by selecting a value for OLD (such as 25,000) and a value for 
NEW that is double the value for OLD (such as 50,000), and letting %CH equal 12.47. 


The resulting value will be #PD = 5.90 (rounded). 
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COMPOUND INTEREST 


Calculation of Interest Rate 


If your retirement fund grows from 10,000 
to 20,000 in six years compounded monthly, 
what annual interest rate are you earning? 
= Compute using percent change/ 
compound interest worksheet 
— Compounded monthly 


PV = 10,000 
FV = 20,000 

N = 6 years x 12 periods per year = 72 
Yi = 7 
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COMPOUND INTEREST 


Calculation of Interest Rate (Cont'd) 


PROCEDURE: PRESS: DISPLAY: 
1. Select percent change/ 2nd A% OLD = (Old contents) 
compound interest worksheet 

2. Clear worksheet | OLD = 0.00 | 
3. Enter present value (10,000) 10,000 Enter OLD = 10,000.00 
4. Move to future value display (NEW) [ v | [ NEW = 0.00 | 
5. Enter future value (20,000) 20,000 Enter NEW= = 20,000.00 
6. Move to # of periods display | v | x #PD = 1.00 | 
7. Enter # of periods 12 Enter #PD = 72.00 
8. Move to interest display (%CH) [ a | | %CH = 0.00 | 
9. Compute interest per period CPT %CH = 0.97 


10. Compute interest per year | x | [12 | | = | | %CH = 11.61" | 


Answer: Annual interest rate = 11.61% 


* Indicates computed number 


Note: When using N periods for time periods other than a year, the interest rate “Yoch” will always be expressed 


per period (i.e., monthly, quarterly or whatever). If that’s what was requested, you have your answer. 
However, if the answer is to be expressed in terms of annualized rate of interest, you have to make the 
conversion to the desired result by multiplying by the number of compounding periods per year. In the 
above example, we computed the monthly interest rate, or 0.97% per month. To annualize this, we 
multiplied the monthly rate by 12. Thus, the annualized rate is 11.61%. 
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Constant Midpoint Progression 


m Use percent change worksheet 
™ Use salary grade and survey midpoints 
m@ Calculate between lowest and highest 
midpoints 
Salary Grade Survey Midpoints 

1,119 
1,019 

918 

834 


a nN © O 


Constant Midpoint Progression 


m™ Use percent change worksheet — When constructing salary ranges, we can use the 
percent change worksheet to calculate a constant midpoint progression. 


™ Use salary grade and survey midpoints — Use the four salary grades provided along 
with survey average salary. 


= Calculate between lowest and highest midpoints — Calculate a constant midpoint 
progression between the lowest and the highest midpoints. 


#PD — Number of intervals — 3 (always the number of grades minus one) 
NEW - Highest Midpoint — 1,119 
OLD - Lowest Midpoint — 834 


This is a useful tool for helping with broadbanding. For example, it can be useful when 
transitioning from 12 grades to 3 or 4 grades. 
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CONSTANT MIDPOINT PROGRESSION 


Constant Midpoint Progression 


(Cont'd) 
Salary Grade Survey Midpoints A% 

9 1,119 

} + 9.8% 
8 1,019 

} + 11.0% 
7 918 

i") 
Ps aq } + 10.1% 


Constant Midpoint Progression (Cont'd) 
In the example above, the A% (midpoint progression) is not an equal distribution. 


In a typical salary structure, if a uniform midpoint progression is not used, lower grades 
have a lower midpoint progression (and, conversely, higher grades have a greater percent 
difference). 


In the example above, the A% appears to be very random. 


GR2 # 3.27 


© WorldatWork. All rights reserved. 


CONSTANT MIDPOINT PROGRESSION 


Calculation of 
Constant Midpoint Progression 


PROCEDURE: PRESS: DISPLAY: 
1. Change format to 1 decimal place 2nd Format DEC =(Old contents) 
1 Enter DEC = 1.0 


2. Select percent change/ DEC =(Old contents) 


compound interest worksheet 


3. Clear worksheet 2nd CirWork OLD= 0.0 
4. Enter present value (834) | oLp = 834.0 | 
5. Move to future value display (NEW) b NEW = 0.0 
6. Enter future value (1,119) NEW = 1,119.0 
7. Move to # of periods display - + #PD = 1.0 
8. Enter # of periods Enter #PD = 3.0 
9. Move to interest display (%CH) a %CH = 0.0 
10. Compute interest per period CPT %CH = 10.3* 


Answer: Constant midpoint progression = 10.3% 


* Indicates computed number 
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CONSTANT MIDPOINT PROGRESSION 


Constant Midpoint Progression 
(Cont'd) 


Applying the Constant Percentages 


Salary Grade Survey Midpoints A% 

9 1,119 

} + 10.3% 
8 1,015 

: + 10.3% 
t 920 

i) 

834 } + 10.3% 


Multiply 834 x 1.103 = 920; 920 x 1.103 = 1,015; 1,015 x 1.103 = 1,119 


Constant Midpoint Progression (Cont'd) 


When constructing salary ranges we can use the percent change worksheet to calculate a 
constant midpoint progression. Use the four salary grades provided along with survey 
average salary. 


Calculate a constant midpoint progression between the lowest and the highest midpoints. 
#PD — Number of intervals -» 3 (always the number of grades minus one) 


NEW -> Highest Midpoint — 1,119 
OLD -— Lowest Midpoint — 834 
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Annuity 


# Definition 


m@ Examples 


Mortgage payment 
— Lottery 


Retirement annuity 


Defined contribution plan 


Annuity 


= Definition — an investment that pays on a scheduled basis over a fixed amount of time 


A series of regular periodic payments comprising principal and interest that lead 
to a predetermined amount in the future 


A stream of equal cash flows delivered over some finite period of time 
(a perpetuity* that ends) 


* Perpetuity is a cash flow without a fixed end point or time period. 


A series of future cash flows (i.e., payments, receipts, deposits or withdrawals) 


m@ Examples 


Mortgage payment — 1,200 monthly payment paid over 30 years 
Lottery — win 1,000,000 jackpot and get paid in 20 annual payments 


Retirement annuity — pension payment of 800 received on the first of every 
month, beginning at age 65 and continuing for the lifetime of the retiree 


Defined contribution plan 
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ANNUITY 


Annuity Example 
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1 What is the 
present value 
oa of 100 per =. 
- year for 65 100.00 100.00 
40 years at 66 100.00 93.46 
7% annual 67 100.00 87.34 
interest? : : : 
85 100.00 25.84 
95 100.00 13.14 
105 100.00 6.68 
4,100.00 1,433.17 
| \ u + +f r |) 
FU POE > r / : | - 
TAK : 
( l4 \" _ Jt C 
( y+c\" 
: es ty 
St V ; Va 
( or ( ( me none 
es / watch the efet 
a , 
y Vv J / 
2 xporea!s y 
/ C \ 
. 7 rod = On 


4 iy & rap 


Can You... 


™@ Define time value of money? 


= Calculate compound interest and 
compound salary growth rate? 


@ Calculate constant midpoint 
progression? 


@ Define annuity payments? 
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Module 3 Quiz 


1. Which of the following should be used to determine the amount of interest earned on 
money over a specified period of time? 


A. Percent difference 
B. Percent adjustment 
C. Compound interest 
D 


. Compa-ratio 


2. How much interest will you earn on 2,500 in three years, if the interest rate is 6%, and 
the interest is compounded annually? 


A. 309.00 
'B. 477.54 
C. 489.05 
D. 977.54 


3. You start with 1,800 in your savings account. You do not make any deposits or 
withdrawals and you earn 10% interest, compounded quarterly. How much money will 
you have in your account after five years? 


A. 1,149.51 
B. 2,674.71 
C. 2,898.92 
D/ 2,949.51 


4. You need 15,000 in two years in order to take that long-awaited and much deserved 
vacation of your dreams. You can earn 12% interest, compounded quarterly from your 
credit union. How much money do you need to set aside now in order to have the funds 


for that vacation? 


A. 10,058.25 
(B. 11,841.14 
G.. V1957.91 
D. 12,802.36 
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Module 3 Quiz (Cont'd) 


5. You have been asked to comment on a proposed provision in the union contract. Over 
the next three years, management proposes increases of 2.0%, 2.5%, 3.0% over the 
three-year life of the contract. The union negotiator insists that it be 3%, 2.5%, 2.0%. 
Assuming the hourly rate for a job class is 12.50/hr., what is the rate in three years? 
A. It will be 13.46 after three years. 

B. Management rate is higher. 
C. Union rate is higher. 
D 


. Cannot calculate. 


6. Makayla’s salary has grown at a rate of 7.3% per year for the past six years. If her 
increases remain the same, how long will her salary take to double? 
A. 9.86 years 
B. 10.42 years 
C. 13.20 years 
D. 27.10 years 


7. You have the following salary grades and midpoints. Your organisation wants to have a 
constant midpoint progression. What would be the midpoint percentage? 


Midpoint 


10 2,000 P| 
11 2,217 
12 2,456 
13 2,698 
A. 8.0% 
B. 9.6% 
(C. 10.5% 
D. 11.7% 
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Module 3 Quiz (Cont'd) 


8. Which of the following describes an annuity? 


A. 
B. 
C. 


What is given up when an investment decision is made 
An investment that pays on a scheduled basis over a fixed amount of time 


Interest that is applied at the end of the period and only on the beginning balance 
or principal. 
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(blank) 
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Module 4: Statistics: Collecting, Organising, 
Grouping and Displaying Data 


Introduction 


In both remuneration and benefits, you use numbers to interpret your environment and 
establish direction. How you gather data makes a significant difference in the conclusions 
you can draw from the information. It also makes a difference in the reliability or validity of 
your conclusions. When talking about, for example, health care costs and changes to the 
levels of benefits you will provide to employees, it is critical to know that your data have 
been collected, organized, grouped and displayed accurately and effectively. 


Objectives 

By the conclusion of this module, you will be able to accomplish the following: 
1. Distinguish between populations and samples. 

2. Create frequency distributions using a quantitative process. 


m™ Organise, group and display data appropriately. 
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Population vs. Sample Data 


= Population 
= Sample 


= Value of samples 


Population vs. Sample Data 


Population 
— A population consists of all the values in the set of data to be analysed. 


— Atwork, you usually will know if you are working with a population. In other 
situations, you have to be given some key words to make such a determination, 
such as “all,” “every” or “this is a population.” 


Sample 
— Asample is a subset of a population. 


— Weare much more likely to work with sample data than with population data. 
There are two very compelling reasons to sample: 


e Time 

e Money 
Value of samples 
— Sample statistics are used to estimate population parameters. 
— Weuse sample statistics to make inferences about populations. 


— Since we can obtain many samples from a population, the key to sampling is to 
obtain a representative sample. 


— The sample must reflect the population if it is to be of value. 
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Population vs. Sample Data (Cont'd) 


Sample 3 


Population 


Sample 2 Sample k 


Population vs. Sample Data (Cont'd) 


There are many, many samples that may be obtained from a population. 
Our task is to obtain a sample that is representative of the population. 


We usually will not know how well our sample represents the population until well after 
the fact. The only way we might know this in advance is to have the population data 
available. 


If we have access to the population data, there is little reason to sample. 
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POPULATION VS. SAMPLE DATA 


Sampling Procedures 


= Random sampling 


= Non-random sampling 


— Convenience sampling 


Sampling Procedures 

™ Random sampling — Chance determines which items are selected. 

= Non-random sampling — The data collector deliberately selects the items to be 
studied. 

Convenience sampling — The data collector selects on the basis of convenience 

or availability. 
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Creating Frequency Distributions: 
Getting Started 


These data represent Data 
ages of 50 employees 22 46 40 38 
in a small business. Be. Sh) AE oT 
45 21 56 43 
36 25 33 25 
53 55 28 28 
47 64 35 38 
57 22 48 35 
35 56 29 31 
27 23 39 41 
25 27 40 58 
18 60 42 46 
32 62 52 

35 42 59 


Creating Frequency Distributions: Getting Started 


What is the variable of interest? 


— Age. (These are the ages of all employees in a small business. Such demographic 
information can be collected to tailor total rewards programs for their employee 
population.) 


What is the level of measurement for your variable of interest? 


— Ratio. [We can say whether the employees are the same or different ages (at least 
nominal); we can place them in rank order by their ages (at least ordinal); the 
difference between 35 and 45 is the same as the difference between 47 and 57 
(at least interval); and there is a true zero for age (ratio).] 


Do these data represent a sample or a population? 


— Population. (We are only interested in these 50 employees. It would have been a 
sample if this were just a subset of a larger data base.) 


Are these data suitable for grouping into intervals (i.e., categories, groups, etc.)? 


— Yes. (In its current format, the data represent chaos. We want to create order out of 
chaos.) 


Are there any other variables available that can be used to better understand the 
variable of interest? 


— Notat the present time. (We only know their ages. We do not know their job types, 
their performance scores, or any other variables about them at present.) 


GR2 mg 4.6 


© WorldatWork. All rights reserved. 


CREATING FREQUENCY DISTRIBUTIONS 


Organise and Group the Data 


These data have been 
reorganised from 
oldest to youngest. 


Organise and Group the Data 
m™ First, we order the raw data from highest to lowest. 


™ Second, we look for opportunities to combine data into groupings (this will help show 
how the data are clustered). 
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CREATING FREQUENCY DISTRIBUTIONS 


-———— 


Grouping Data 
= Group the data 


— Intervals 
¢ Number of intervals 
e Interval widths 
¢ Create intervals 
¢ Distribute data 


¢ Create a picture 


Grouping Data 


= Group the data 


Intervals — Group the data into intervals or categories in order to better visualise 
the distribution and look for patterns or trends. 


Number of intervals — Determine the number of intervals desired. 
Interval widths — Determine the appropriate interval widths. 
Create intervals — Create the intervals. 

Distribute data — Distribute the raw data into the intervals. 


Create a picture 
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CREATING FREQUENCY DISTRIBUTIONS 


Grouping Data into Intervals 


= 5 to 20 intervals 
= Constant width 


= No overlaps 


=» Accommodate all data 


Consider odd widths 


Grouping Data into Intervals 


= 5 to 20 intervals — It is not often that one would use fewer than 5 or more than 20 
intervals (some sources suggest 5 to 15 intervals). The number of intervals is an 
arbitrary decision. 


= Constant width — Make sure each interval is the same width (i.e., a constant width for 
each interval). 


™ No overlaps — Make sure each data point can go into one and only one interval 
(i.e., each interval is mutually exclusive, with no overlaps). 


= Accommodate all data — Make sure the set of intervals will accommodate all the data 
(i.e., collectively exhaustive). 


= Consider odd widths — Consider using odd widths for intervals when appropriate, in 
order to ensure that the midpoints of the intervals will be integers. 
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CREATING FREQUENCY DISTRIBUTIONS 


Determining the 
Widths of Intervals 


= Find the range 


= Divide the range by 5 and by 20 


= Make a guided arbitrary decision 


Determining the Widths of Intervals 


m Find the range — Find the difference between the highest and lowest values 
(i.e., find the range). 


64-18 =46 

™ Divide the range by 5 and by 20 
46 + 20=2.3 
46+5=9.2 


For this distribution of age data, interval widths between approximately 3 and 9 will 
yield between 5 and 20 intervals. 


m™ Make guided arbitrary decision — We will make the guided arbitrary decision of 
using interval widths of 5. 


GR2 w 4.10 


© WorldatWork. All rights reserved. 


CREATING FREQUENCY DISTRIBUTIONS 


Frequency Distribution 


Based on Intervals of Grouped Data 
Create the Frequency Table 


Frequency Distribution Based on Intervals of Grouped Data 


= Absolute frequency — Enter the number of data points that are within the interval. 
There are four data points that are in the interval 20-24. 


™ Relative frequency — Calculate the relative frequency by dividing the absolute 
frequency by the total number. For the interval 20-24, the relative frequency is 
4 divided by 50 or 8%. 
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CREATING FREQUENCY DISTRIBUTIONS 


Frequency Distribution 


Based on Intervals of Grouped Data 
Completed Frequency Table 


= 
=| R/O; RIO!|N|) ON] O/ & 


Frequency Distribution Based on Intervals of Grouped Data 
The graphic shows the completed frequency distribution table. 


m@ Why are we calculating the relative frequencies? 
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CREATING FREQUENCY DISTRIBUTIONS 


Create a Picture: 
Bar Graph for Grouped Age Data 


Absolute Frequencies 


15-19 20-24 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 


Interval Ranges for Age Groups 


Create a Picture: Bar Graph for Grouped Age Data 


Use a frequency distribution table to create a picture and arrange the data for analysis. 
A bar graph is a good example of the type of picture that can be created (as shown above). 


This process began with random data and now it is beginning to tell us something. When 
data are organised, they start to become information. 


In this example, the graph shows “absolute frequencies” instead of “relative frequency 
percents” because we are not comparing this distribution of ages against any other data 
set. If we were to compare this distribution of ages against some other data set, such as 
across departments, across geographic locations, or across industries, we would most 
likely want to convert these absolute frequencies to relative frequency percents, since it is 
unlikely that a comparison data set would have the exact same number of people as this 
data set. 


A bar graph can help visualise 


a frequency distribution. 
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Case Study 


Madrid 
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CASE STUDY 


Testing the Model 
with a Case Study: 
Madrid Data 


Territory Madrid 


ailelehVas-3) 20 field representatives of a large 
organization sampled from this 
| particular geographic region 


Variable : 
| of interest — 


| Bonus points 


Testing the Model with a Case Study: Madrid Data 


What is the variable of interest? 
— Bonus points. 
What is the level of measurement for your variable of interest? 


— Ratio. (This depends on how bonus points are administered. If we can assume 
field representatives receive one bonus point for each new customer acquired, 
they we have satisfied ratio level scale. This is why identifying and understanding 
our metrics is important.) 


Do these data represent a sample or a population? 

— Sample. (Given in the story problem.) 

Are these data suitable for grouping into intervals (i.e., categories, groups, etc.)? 
— Yes. (We will review the data to make this determination.) 


Are there any other variables available that can be used to better understand the 
variable of interest? 


— Not at the present time. 
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CASE STUDY 


Madrid Data 
in Raw, Unordered Form 


Points 


1 11 28 
2 12 9 
3 13 45 
4 14 18 
5 15 59 
6 16 20 
7 17 15 
8 18 63 
9 19 24 
10 49 20 39 


Madrid Data in Raw, Unordered Form 


™ Data often come in without any structure or order. In this raw, unordered form, it is 
difficult to make any conclusions about the data. 


™ The first step is to organize the data by listing the bonus points from the highest to the 
lowest, called “rank order.” 
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CASE STUDY 


Madrid Data 
in Raw, Ordered Form 


Bonus 
Points 


Salesperson 


Madrid Data in Raw, Ordered Form 
m When graphing data, what would be on the x-axis? 


m When graphing data, what would be on the y-axis? 
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Graph of Madrid Data 
in Interval Widths of 1 


Madrid: Raw Data Format 


> 


oo 


Frequencies 
LS) 


Steal 


; EEGs 
"G 3 6 sa a Pir Sara arr Sp apes ar 


Bonus Points 


Graph of Madrid Data in Interval Widths of 1 


Graph of bonus points for Madrid without grouping the data 
The variable of interest (bonus points) is located on the x-axis. 


There is no other variable. Instead, we locate the frequencies of bonus points on the 
y-axis. 


The graph is very flat. 
No apparent trend or pattern is easily seen. 


We might want to group the data in an effort to identify patterns or trends. 


GR2 w 4.18 


© WorldatWork. All rights reserved. 


CASE STUDY 


Determining the Widths 
of Intervals 


= Find the range 
= Divide the range by 5 and by 20 


= Make a decision 


Determining the Widths of Intervals 

m Find the range — Find the difference between the highest and lowest values. 
63-9 =54 

m™ Divide the range by 5 and by 20 
54+20=2.7 
54+5=10.8 


— For the Madrid data, interval widths between approximately 3 and 11 will yield 
between 5 and 20 intervals. 


— Odd widths yield whole numbers for midpoints. 
— Even widths yield decimals for midpoints. 


m Make a decision — We need to make a guided arbitrary decision. 
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CASE STUDY 


Frequency Distribution for Grouped 
Madrid D rval Widths of 5 


LL — Lower limit of interval 


60 1 
55 2 UL — Upper limit of interval 
50 0 MP — Midpoint of interval 
45 3 
40 0 F — Absolute frequency 
35 3 F% — Relative frequency 
30 1 
25 2 
20 4 
15 3 
10 ) 

5 1 Bottom interval 

0 0 begins at the origin 


TOTAL | 20 | 100% 


Frequency Distribution for Grouped Madrid Data: Interval Widths of 5 


There are now 12 intervals (meeting the recommendation of having between 5 and 20 
intervals). 


The midpoints of each of the intervals are integers (whole numbers): 2, 7, 12, 17, 22, 
27, 32, 37, 42, 47, 52, 57, 62. 


The bottom interval does not include any data, but it does keep us at the “origin.” 
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CASE STUDY 


Madrid Data: 
Interval Widths of 5 


Absolute Frequencies 


Bonus Points (i.e., Prize Points) 


Madrid Data: Interval Widths of 5 


A computer-generated bar graph is more sophisticated but does not yield any more 
information. 
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CASE STUDY 


Frequency Distribution for Grouped 
Madrid Data: Interval Widths of 10 


F% 
z= 
| 10% 
15% 
20% 
25% F%-Relative frequency 


LL — Lower limit of interval 


UL —- Upper limit of interval 


MP — Midpoint of interval 


F — Absolute frequency 


fe) 
oi Bottom interval does not 


5% begin at the origin 
| 100% 


Frequency Distribution for Grouped Madrid Data: Interval Widths of 10 


Notice that the bottom interval goes from 1 to 10. What would you do if you have a new 
employee who has no bonus points? You should always begin at the origin (0) whenever 
possible and reasonable. We will illustrate this in a later graph. 
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CASE STUDY 


Madrid Data: Interval Widths of 10 
Starting at 1 Instead of 0 


Absolute Frequencies 


Bonus Points 
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CASE STUDY 


Frequency Distribution for Grouped 
Madrid Data: Interval Widths of 10 
Starting at 0 Instead of 1 


wm 
it 


=]=(|Wi/O;};P/)/O!N/ —1) 


Frequency Distribution for Grouped Madrid Data: Interval Widths of 10 
Starting at 0 Instead of 1 


This time the bottom interval goes from 0 to 9. We can now accommodate new employees 
who do not have any bonus points yet. You should always begin at the origin (0) whenever 


possible and reasonable. 


GR2 @ 4.24 


© WorldatWork. All rights reserved. 


CASE STUDY 


Madrid Data: Interval Widths of 10 
Starting at 0 


Absolute Frequencies 


4.5 14.5 24.5 34.5 44.5 54.5 64.5 
Bonus Points 


Madrid Data: Interval Widths of 10 
Starting at 0 
™ Here is the graph when going from 0 to 9. 


m™ The shape is slightly different from the previous bar graph. This change is a direct 
function of the number of data points and the number of intervals selected, as well as 
the shape of the distribution. 

™ Notice that this graph and the previous bar graph for Madrid are based on the absolute 
frequencies. 


m= We might want to use “relative frequency percents” if we later decide to compare the 
distribution for Madrid to the distribution for some other geographic location, since the 
two sets of data are very likely to consist of different numbers of field representatives. 
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Can You... 


= Distinguish between populations 
and samples? 


= Create frequency distributions 
using a quantitative process? 


— Organise, group and display data 
appropriately? 
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Module 4 Quiz 


You have been given the task of examining the effect of a pay-for-performance project 


on employee productivity. You have obtained a “productivity rating score” for all 
program participants. Which of the following best describes your data? 


A. 


The data represent a sample. 


B. The data represent a population. 
C. 
D 


. Productivity rating scores always produce ratio data. 


Productivity rating scores only produce nominal data. 


Why do we sample? 
A. 
B. 
C. 
D. 


Because samples are more accurate than populations 
Because of the lack of time and money 
Because computers only work with sample data 


Because population data are unreliable 


Which of the following provide useful guidelines for grouping data into categories? 


A. 


Use a constant width for all categories. 


B. Use mutually exclusive categories. 
C. 
D. All of the above. 


Do not allow categories to overlap. 


GR2 @ 4.27 


© WorldatWork. All rights reserved. 


(blank) 
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Module 5 


Statistics: 
Lying with Statistics, 
Graphs and Displays 


WorldatWork B 
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Module 5: Statistics: Lying with Statistics, 
Graphs and Displays 


Introduction 


Ideally, statistics can be used to effectively organise and summarise data. When carefully 
prepared and interpreted, statistical information can help weigh options and support 
organisational decision making. The goal of clear, accurate quantitative information can 
be hampered, however, if misleading features in the data are not identified and avoided. 
In this module, we will review common errors that can lead to distorted data. 


Objectives 


By the conclusion of this module, you will be able to accomplish the following: 
1. Recognise data that may have been distorted. 


2. Know how to avoid making mistakes that distort data. 
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“Lying” with Statistics 


@ Objectivity 
@ Distorting data 


™ Know when you are being misled, 
and avoid misleading others 


“Lying” with Statistics 


™ Objectivity — We should always develop a plan that provides an objective approach to 
collecting data, organising data and displaying data. All too often, though, we lose track 
of the plan due to lack of objectivity. 


—™ Distorting data — With the advent of the computer, statistical software, and the ready 
availability of graphic tools, the temptation to distort the data (whether conscious or 
subconscious) is great. 


— We should be true to our analysis and follow our plan of objectively analysing and 
presenting our data. 


= Know when you are being misled, and avoid misleading others — Our purpose in 
this module is not to teach you how to lie with statistics. 


— Instead, we hope to teach you to know when you are being lied to with statistics 
and how to avoid lying to others with statistics. 
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“LYING” WITH STATISTICS 


Example 1 


Mortality Graph 


Number of Survivors 
ui 
So 
So 


65 66 67 70 75 80 85 90 95 100 105 
Age 


Example 1: Mortality Graph 


Could this graph be misleading? Mortality Table 


™@ Note that this graph only shows certain 
«. Numb f Survivo 
ages. Without the use of the “railroad Probability = iil eMedia aid 
1,000 Retirees 


tracks” given on the X-axis, we might 
have concluded there is a more dramatic 
trend occurring after age 67, relative to 
the trend from age 65 to age 67. While 
this, in reality, may or may not be the 
case, the graph does not appropriately 
reflect the data. 


m@ Inthis example, it is not unreasonable 
that the “age” variable did not begin at 0, 
since the only relevant data for this graph 
would be all normal retirees (who have 
reached 65 or greater). If the graph 
began at 0, the graph would be so wide 
to display all the data that the pertinent 
information would become very small on 
the visual representation. 
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“LYING” WITH STATISTICS 


Example 2 
Salary Budget Increase Trends 
Showing Dramatic Changes Over Time 


f& Hourly O Salaried 


10.0 i 
33: 

i 
80 4 
10 ef 
40 off) of ofl, 


Year of Data 


Percent Increase over 
Previous Salary Budget 


Example 2: 
Salary Budget Increase Trends Showing Dramatic Changes Over Time 


™ Does this graph suggest anything? 
— Look at the pattern of budget increases for salaried vs. hourly. 
— Look at the trend. 


— Does the trend appear to be dramatic? 
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“LYING” WITH STATISTICS 


Example 2 
Salary Budget Increase Trends 
Using Appropriate Graphing 


& Hourly O Salaried 


7919801981929 I9RK URS 1989 1990 199119921998 1994 199519951997 199819992000 


198619871988 
Year of Data 


Percent Increase over 
Previous Salary Budget 


Example 2: 
Salary Budget Increase Trends Using Appropriate Graphing 


™ Does this graph look different from the previous graph? 
— Again, look at the trend. 
— Does the trend appear to be less dramatic than in the previous graph? 


m Why are they different? 
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“LYING” WITH STATISTICS 


Example 2 
Salary Budget Increase Trends — 
How the “Lie” Was Committed 


Percent Increase over 
Previous Salary Budget 


3 1984198 9 1990 199119921993 198419951996. 199719981999 2000 


198619871988 
Year of Data 


Example 2: 
Salary Budget Increase Trends — How the “Lie” Was Committed 


™ Note the scale of the graph on the Y-axis in the first graph and then in the second 
graph. 


m™ The first graph hides the bottom part of the graph. 
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“LYING” WITH STATISTICS 


Example 3 
Was It Twice As Hot 
Yesterday As Today? 


Temperature for Two Dates 


Temperature (Fahrenheit) 


Yesterday Today 


Example 3: Was It Twice As Hot Yesterday As Today? 


Note that the intervals have constant widths. That is, the difference from 70-80 degrees is 
the same as the difference between 60-70 degrees, etc. 


Is there a true zero for temperature (for Fahrenheit or Celsius)? 
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“LYING” WITH STATISTICS 


Example 3 
Is There a True Zero? 


Temperature for Two Dates 


Temperature (Fahrenheit) 


Yesterday Today 


Example 3: Is There a True Zero? 


Since temperature data, measured in Fahrenheit or Celsius (not Kelvin), do not have a true 
zero point, graphing such data can often be misleading. 
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“LYING” WITH STATISTICS 


Example 4 
Distribution of Types of Jobs 


— 


Distribution of Employees by Functional Area 


Percentage in Category 


Accounting — Engineering Production Clerical Marketing 


Types of Jobs 


Example 4: Distribution of Types of Jobs 

m@ What is the variable of interest? 
— Job title 

m What is the level of measurement for the variable of interest? 
— Nominal 

m@ Then why are the five categories listed in this particular order? 


— Could it be that someone wants to suggest something about a normal 
distribution? 
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“LYING” WITH STATISTICS 


Example 4: Distribution of Types of Jobs 


Distribution of Employees by Functional Area Distribution of Employees by Functional Area 


‘Accounting Engineering Production Clerical Marketing Accounting Marketing Clerical‘ Engineering Production 
Types of Jobs Types of Jobs 


Is it normal? Is it skewed with a tail to the left? 


Distribution of Employees by Functional Area 


Production Clerical Engineering Accounting Marketing 
‘Types of Jobs ‘Accounting Clerical Engineering Marketing Production 


Is it skewed with a tail to the right? Should it be alphabetically ordered? 


Example 4: Distribution of Types of Jobs (Cont'd) 
™ Here is the same information, presented four different ways. 
™ One could (incorrectly) argue it is: 

— Normal 

— Skewed with a tail to the left 

— Skewed with a tail to the right 


m There is no right answer about a distribution shape for nominal data. It might be more 
honest to simply list the categories in alphabetical order. 
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“LYING” WITH STATISTICS 


Example 5 
Displaying Percents 
with Stacked Bar Charts 

% 

Benefits 30 
Labour 

Relations 20 
Personnel 

Services 20 

Remuneration 15 

Training & 15 
Development 

This Year 


Next Year 


% 


35 


15 
15 


20 


15 


Example 5: Displaying Percents with Stacked Bar Charts 


™ Does the benefits department have a larger budget next year? 


m What about the remuneration department? 
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Can You... 
m™ Recognise data that may 
have been distorted? 


@ Avoid making mistakes 
that distort data? 
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Module 5 Quiz 


1. How can HR professionals ensure their statistical data are not distorted? 


A. 
B. 


Use statistical software, which prevents distortion of data. 


Develop a plan that provides an objective approach to collecting, organising and 
displaying data. 

Avoid training that shows how data can be distorted, as that training might cause 
one to distort data subconsciously. 


Distortion of data is commonly accepted as a method for making compelling 
arguments for change in organisations. 


Which of the following is true of distribution shapes for nominal data? 


A. 


They generally are normal distributions. 


B. They are best represented skewed with a tail to the left. 
C. 
D. It might be most honest to simply list the categories in alphabetical order. 


Nominal data should be represented via pie charts only. 
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Module 6 


Statistics: 
Measures of Central Tendency 
and/or Location 


WorldatWork B 
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Module 6: Statistics: Measures of Central Tendency 
and/or Location 


Introduction 


Measures of central tendency are measures of the location of the center of a distribution, 
or, in other words, the most typical, representative value. Measures of location provide 
information about other points in a data set. Together, these tools help make sense out of 
raw data so the HR professional can interpret and present the results of his or her research. 
In this module, we will review measures of central tendency and location, and discuss 


percentile bars. 


Objectives 

By the conclusion of this module, you will be able to accomplish the following: 

1. Identify and define measures of central tendency, including mode, median and mean. 
2. Identify and define measures of location, including percentiles. 


3. Discuss percentile bars. 
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Measures of Central Tendency 
@ Provide information about the 
center of a distribution 
m™ Mode, median, mean 
@ Three different approaches 


# No information about variance 


Measures of Central Tendency 


= Provide information about the center of a distribution — Measures of central 
tendency are specific measures of location that provide information about the center of 
a distribution or about the most typical, representative value. 


™ Mode, median, mean — Common measures of central tendency include the mode, 
median and mean. 


m= Three different approaches — These measures are based upon three different 
approaches that help identify typical values for distribution. 


® No information about variance — Measures of central tendency do not give any 
information about how varied or dispersed data are. 
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MEASURES OF CENTRAL TENDENCY: MODE 


Mode 


Mode > 24 


1 
1 
1 
3 
3 
30 2 
6 
3 
1 
1 
1 
1 


Mode 

m™ The mode is the value that occurs most often among a data set. 
Example: 

Organizing data for analysis 

m™ Place the data into rank order. 


m= Group the data into intervals or common values. 


What if there were six individuals with 36 months? 
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MEASURES OF CENTRAL TENDENCY: MODE 


Mode 


What if there are 
six individuals 
with 36 months? 


1 
1 
1 
3 
6 
2 
Mode > 24 6 
3 
1 
1 
1 
1 


Mode 


Suppose we found there were two more employees who had 36 months’ experience in 
benefits administration. What value(s) would now best describe the mode? 


@ 24 and 36 


= The distribution is now bimodal. 
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MEASURES OF CENTRAL TENDENCY: MEDIAN 


Median 


Example: Odd number of data points 


37 39 
38 38 
36 37 37. © Median 
40 37 
39 36 
37 32 


Median 


The median corresponds to a point along an ordered distribution such that the same 
number of observations fall above and below this point. 


m Find the median number of bonus points earned for the following set of data: 
32, 37, 38, 36, 40, 39 and 37. 


Place the data into rank order. 


Determine the median. 


The median discounts extremes and measures only 


the middle value, which is most representative 
of what is happening in the middle of the data set. 
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MEASURES OF CENTRAL TENDENCY: MEDIAN 


Median 


Example: Even number of data points 


Tecan = 11 


Median 


For an even number of data points, the median is the average of the two middle-most 
observations. 


m™ Find the median hourly pay for the following set of data: 13, 10, 14, 7, 9, 7, 12 and 13. 
m™ Place the data into rank order. 


m@ Determine the median. 
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MEASURES OF CENTRAL TENDENCY: MEAN 


Mean 
@ Arithmetic average 


™ x represents the mean 


~ Mean — Sum of all numbers 


Number count 


Mean 


= Arithmetic average — The mean is the arithmetic average. 


Mean = Sum of allnumbers 
Number count 


™ xXrepresents the mean — We use x (called “x bar’) to represent the mean. 
Sum (x) 
N 


x= 
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MEASURES OF CENTRAL TENDENCY: MEAN 


Example 
Data X - 104+14+12+16+12 
etd: | ? 
a 
8] ° 
fae = 12.8 years of experience 


Example 


Find the mean number of years of experience for five employees, based upon the 
data provided. 
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MEASURES OF CENTRAL TENDENCY — USING THE CALCULATOR 


Using the Calculator: Entering Data 


before we begin. 


e 
a 


O 


Ee] ee) Let’s arbitrarily 

om ®]) set the number 

ca Po of decimal 

io ct | places to 1 
CI} 


? 
> 
3 
Be 


COO ils CF 


" 
0 
'B 


i 


Using the Calculator: Entering Data 


m™ The “Stat” menu includes a number of statistical models for analysing data. When we 
are only working with one variable (i.e., one variable of interest and no other variable, 
such as a predictor variable), we want to be in the one-variable model. The calculator 
uses 1-V to symbolise this model. This is the default menu, for which the calculator was 


pre-set. 
m™ There are five possible models, in the order they will appear within the calculator menu: 
—  1-V (one variable model) 
— LIN (linear model) 
— Ln (natural log.model) 
— EXP (exponential model) 
— PWR (power model) 


m™ Tochange from one model to another, you must be in the “Stat” menu. 


Note: If you are in the "Stat" menu and press 2nd function, Clear Worksheet, the calculator will automatically 
reset itself to the LIN mode. In many cases, this might not be the mode you want for the problem at 
hand. If not, then you will have to press 2nd function, Set, then 2nd function, Set, etc., until you move to 


the desired mode. 
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MEASURES OF CENTRAL TENDENCY — USING THE CALCULATOR 


Calculation of the Mean — 
Compute Using “Statistics Worksheet” 


PROCEDURE: PRESS: DISPLAY: 


1. Select worksheet. 2nd Data X01 = (Old contents) 


2. Clear worksheet. 2nd X01 = 0.0 


3. Enter data. 


a)Enter 1st data point. 


Move down 2 displays. 


b)Enter 2" data point. 


Move down 2 displays. 


c) Enter 3% data point. 


Move down 2 displays. 


d)Enter 4" data point. 


Move down 2 displays. 


m 
pe | 
= 
@ 
+ 
x 
jo) 
ow 
I 
— 
oa 
oO 


e) Enter 5' data point. 
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MEASURES OF CENTRAL TENDENCY — USING THE CALCULATOR 


Calculation of the Mean — 
Compute Using “Statistics Worksheet” (Cont'd) 


Checking your data 


m After entering the last value (12 in this example) and pressing “Enter,” you have 
completed the data entry process. 


Now press the “Down Arrow” key four more times. 
This will take you to the top of your data list. 


You can press the “down arrow” repeatedly from here to scroll through your entire data 
base. 


m™ You can correct data entry errors by simply replacing incorrect values with the correct 


values. 
— For example, if you accidentally input 22 instead of 12 for the third X-value, scroll 
to X03 = 22 


— Enter 12, and press “Enter.” 
— You have replaced the incorrect value (22) with 12. 
— You have now corrected your data. 


m™ These values will remain in your calculator until you clear your worksheet (by pressing 
2nd function key, followed by pressing the CLR Work key). 


m If you accidentally input an extra value in your list, you can delete it. 


— For example, if you did not intend to have the 12 in your list at all and now want 
to remove that number without having to completely re-enter the entire set of 
data, you can simply scroll to the value you want to eliminate and then press 2nd 


function, DEL (for Delete). 


m™ Remember, you cannot simply replace the value you want to eliminate from the list by 
inputting a O and pressing Enter. Zero is a real number, and it will be included as 
another value in your list. 


m Inthe same manner, if you want to insert a value somewhere in the middle of the list, 
you can scroll to where you want to input the new value, then press 2nd function, INS 


(for Insert). 
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MEASURES OF CENTRAL TENDENCY — USING THE CALCULATOR 


Using the Calculator: 
Select the correct model in the “Stat” menu 


|2nd| < Stat > [LIN (or another model) | 
2nd Set Ln 


Set | EXP 
| 2nd Set | 1-V 


Set PWR 
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MEASURES OF CENTRAL TENDENCY — USING THE CALCULATOR 


Using the Calculator 
In 1-V, arrow down to find n and x 


= 5.0 


n 
v [x= 12.80 | 
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MEASURES OF CENTRAL TENDENCY: MEAN 


Weighted vs. Unweighted Mean 


Example: Teller Salaries at Three Banks 


Mean Annual 
Salary of 


Incumbents 


20,000 
18,400 
56,400 


Weighted vs. Unweighted Mean 


The weighted mean gives more weight to the companies who have more people. The 
unweighted mean treats all three banks equally and disregards the number of people. 


= Weighted mean: obtained by weighting each value by the number of times that value 


has occurred in the set of data, and then averaging. 


= Unweighted mean: obtained without weighting each value by the number of times that 
value has occurred in the set of data. In some surveys, raw data are not disclosed. 


— Instead, data are presented in summary form. 
Example: Incumbents and average salaries given by company 


m™ The distinction between weighted and unweighted means then becomes important. 
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MEASURES OF CENTRAL TENDENCY: MEAN 


Weighted vs. Unweighted Mean 


Example: Teller Salaries at Three Banks 
Additional Information: Number of Incumbents 


_ Mean Annual 
Numberof | Salaryof 


Incumbents Incumbents _ 
18,000 


20,000 


18,400 


Find the weighted mean salary of the 45 incumbents. 
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MEASURES OF CENTRAL TENDENCY: MEAN 


Calculation of the Mean — 
Compute Using “Statistics Worksheet” 


Example: Find the average salary for tellers at the three banks. 


PROCEDURE: PRESS: DISPLAY: 
1. Select worksheet. 2nd Data X01 = (Old contents) 
2. Clear worksheet. 2nd X01 = 0.00 
3. Enter data. 
a) Enter 1st data point. 18000 Enter X01 = 18,000.00 
Move down 2 displays. we v X02 = 0.00 
b) Enter 2nd data point. 20000 [ x02 = 20,000.00 | 
Move down 2 displays. + v | x03 = 0.00 | 
c) Enter 3rd data point. 18400 Enter X03 = 18,400.00 
Move down 2 displays. + v X04 = 0.00 
4. Select statistical calculation 
portion of statistics worksheet. (Old contents) | 


5. Clear worksheet. 2nd Clr Work LIN 


6. Select one-variable 


calculation method. [ 1-v 


7. View the number of cases. + = 


8. View the mean. Lv | X= 


Note: LIN = Standard linear regression, 1-V = One-variable statistics 
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MEASURES OF CENTRAL TENDENCY: MEAN 


Manual Calculation 


MeanAnnual —_ Total 


Number of _ Salary of — Incumbent | 

Bank C Incumbents _ _ Salaries 
A 15 x 18,000 = 270,000 
B 20 x 20,000 = 400,000 
Cc 10 x 18,400 = 184,000 
45 56,400 854,000 

Weighted = Sse = 18,977.78 

45 
Unweighted = a = 18,800 


Manual Calculation 


= Weighted mean — To calculate the weighted mean (find the mean salary of the 
incumbents), multiply the number of incumbents by the means in each company, and 
divide the total number of incumbents into the total of all the incumbents’ salary means. 


= Unweighted mean — To calculate the unweighted mean (find the mean salary in the 
companies), divide the total number of companies into the total mean annual salary of 
incumbents. 
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MEASURES OF CENTRAL TENDENCY: MEAN 
Calculation of the Weighted Mean — 


Compute Using “Statistics Worksheet” 


Example: Find the weighted average salary for the 45 tellers at the three banks. 


PROCEDURE: 


1. 


Select worksheet. 
Clear worksheet. 


Enter data. 


a) Enter 1st data value. 


Move down 1 display. 


b) Enter frequency for first value. 


Move down 1 display. 


c) Enter 2nd data value. 


Move down 1 display. 


d) Enter frequency for 2nd value. 


Move down 1 display. 


e) Enter 3rd data value. 


Move down 1 display. 
f) Enter frequency for 3rd value. 


Select statistical calculation 
portion of statistics worksheet. 


Clear worksheet. 

Select one-variable 
calculation method. 

View the number of cases. 


View the mean. 


PRESS: DISPLAY: 

2nd Data X01 = (Old contents) 

2nd | xo1 = 0.00 | 
18000 Enter X01 = 18,000.00 
v Y01 = 1.00 

15 Enter | Yo1 = 15.00 | 

X02 = 0.00 | 
20000 Enter ee 20,000.00 
v Y02 = 1.00 

20 [ Enter] | Yo2= 20.00 | 

lv | X03 = 0.00 | 
18400 Enter X03 = 18,400.00 
v Y03 = 1.00 

Y03 = 10.00 | 


2nd Stat 
2nd Set 


(Press both 4 times) 


v 
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(Old contents) 


ail 


1 
[n = 45.00 
X= 18,977.78 


MEASURES OF CENTRAL TENDENCY: MEAN 


Using Means 


m@ Define your audience 

= Know definition 

= At least interval level data 
m@ Influenced by outliers 


m@ Most common 


Using Means 
™ Define your audience — When you report a mean, define it to your audience. 

— This is especially important when you are using a weighted or unweighted mean. 
= Know definition — When you read a mean, know its definition. 

— Ask questions if you are not sure what kind of mean you are reading. 


= At least interval level data — As a general rule, you should have at least interval level 
data before using the mean. 


m Influenced by outliers — Means are influenced by outliers, while medians and modes 
are not. 


= Most common - The mean is the most commonly reported measure of central 
tendency. 
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MEASURES OF CENTRAL TENDENCY 


Comparing Various Measures 
of Central Tendency 


Annual Pay 
60,000 
65,000 
70,000 

75,000 


Employee 
Wendy 
Theo 
Janine 


Median 
Mean 


70,000 


™ Notice the mean and the median are equal. 
@ The distribution is completely flat. 
= Therefore, there is no mode. 


Comparing Various Measures of Central Tendency 
m™ Inthis example, the mean and median salaries are the same. 


m™ The mode salary is not applicable (N/A), since no value occurs more than once. 
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MEASURES OF CENTRAL TENDENCY 


Comparing Various Measures 
of Central Tendency (Cont'd) 


Steve was replaced by Bill, Employee Annual Pay 


the CEO. Wendy 60,000 
How does his salary impact: Theo 65,000 
— the mode? Janine 70,000 


=” ian? 
the median? Mary 75,000 


930,000 


— the mean? 


Central 
Tendency 


Mode 


Median 


m@ The mode remains the same. 
m@ The median remains the same. 
@ The mean salary has now tripled. 


@ Only the mean is influenced by 
outliers. Mean 


N/A 
70,000 
240,000 


Comparing Various Measures of Central Tendency (Cont'd) 


m In this example, the mean has dramatically increased when the salary of 80,000 is 
replaced with a salary of 930,000. The mean salary has tripled from 70,000 to 240,000, 
but the median has not changed. 


Sometimes, the median might better reflect the central tendency of the data. 


Even though we have at least interval level data (actually ratio), the mean might not be 
the best measure. 


m We have several options: 
— Use the median. 


— Trim the mean by eliminating the same number of values from the top and bottom 
of the list. 


— Eliminate the outlier. 


m™ Ofcourse, we must have a very sound reason to justify eliminating the outlier. In this 
case, the CEO’s pay would not be representative of HR employees. If we are 
describing the typical HR employee’s pay, we should drop the outlier. 
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Measures of Location 


™ Based on same concept as median 
™ Quartiles 
@ Percentiles 


# No information about variance 


Measures of Location 


Based on same concept as median — Measures of location are based on the same 
concept used to find the median. 


— The median provides information about the central tendency of the distribution. 

— Other measures of location represent other positions in the distribution. 
Quartiles — In human resources management, we often use quartiles. 

— Py, P59, and P;, are the upper limits of the 25'", 50, and 75" percentiles. 

— Q,, Q,, and Q, are the upper limits of the 1%‘, 2"¢, and 3" quartiles. 

— Triciles, quintiles and deciles are also commonly used. 


Percentiles — Percentile bars, based on the 10%, 25th, 50th, 75t, 90" percentiles and 
the mean, provide very useful visualisations of a distribution. 


No information about variance — Measures of location offer no information about how 
varied or dispersed the data are. 
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Measures of Location (Cont'd) 


25% 


Qnd 
quartile 


3rd 
quartile 


4st 
quartile 


quartile 


P25 Pso P2. 
Median 


Measures of Location (Cont'd) 
™ Measures of location are very useful in setting pay philosophy. 
= A value that a given percent of the data is less than 
Example: The 90" percentile is that value at which we have surpassed 90% of the data. 


P,, = 75" percentile 


50" percentile (median) 


Pro 


P,, = 25'" percentile 
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MEASURES OF LOCATION — FINDING PERCENTILES 


Measures of Location — 
Finding Percentiles 


™@ Determine point that 
surpasses x% 


mg v‘h value 


m@ Excel approach 


Step 1: [(x +100) x n|+[4-(x + 100)| 


x 
a (n-1) aa | 


|O© 


Measures of Location — Finding Percentiles 


Determine point that surpasses x% — We need to determine the point at which we 
have surpassed x% of the data. 


vth value — We need to determine the vn value from the bottom of the rank-ordered list 
that corresponds to this rank-ordered position. 


Excel approach — The Excel approach to find the v'" value from the bottom is as 
follows: 


v = [(x + 100) x n] + [1 — (x + 100)] 
Where: 
x = the desired percentile (25t", 50*, etc.) 


n = the number of cases in the list 
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MEASURES OF LOCATION — FINDING PERCENTILES 


Example — 
Finding Percentiles 


To find P,, (the 90" percentile) corresponding 
to a list of 20 values, we need to locate the v‘" value 
from the bottom as follows: 

v = [(x +100) x n] + [1 —(x + 100)] 


= [.90 x 20] + [1 — (90 = 100)] = 18.0 + 0.10 = 18.1 


Example — Finding Percentiles 


We need to find the 18.1 value from the bottom. 
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MEASURES OF LOCATION — FINDING PERCENTILES 


ae 
18 me 
2 
1 
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MEASURES OF LOCATION — FINDING PERCENTILES 


Finding the 90‘ Percentile 
| Bonus Rank | 

Points Order . 

= 59 19 
»57 18 v = [(x + 100) x n] + [1 — (x + 100)] 
49 4 
=o = [(90 = 100) x 20] + [1 — (90 + 100)] 
45 15 | = 
45 18 =18+.10 
38 16 = 18.1 
37 12 
32 | Find the 18.1 value from the bottom 
25 9 
24 8 —— || 
23 7 
22 6 
20 5 
18 4 
17 3 
15 2 

9 1 
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MEASURES OF LOCATION — FINDING PERCENTILES 
— 


Finding the 90* Percentile (Cont'd) 


= Determine the value that 
corresponds to the rank-ordered 
position 


— Often contains a remainder 
— Often falls between two values 


— Interpolate 


Finding the 90 Percentile (Cont'd) 
™ Determine the value that corresponds to the rank-ordered position 


— Often contains a remainder — The v‘h value often contains a remainder rather 
than being a whole number (e.g., the 18.1 value). 


— Often falls between two values — The v‘h value often falls between two 
observed values in the list. This value will often be larger than one value but 
smaller than the next rank-ordered value. 


— Interpolate — We interpolate to determine an acceptable approximation. 
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MEASURES OF LOCATION — FINDING PERCENTILES 


Interpolation 


= Percentile value 
@ Find the difference 


@ Multiply the difference by the 
decimal remainder 


@ Add product to smaller value 


Interpolation 


Percentile value — The percentile value will be at least as big as the lower of the two 
values. 


Find the difference — Begin with the lower value, then find the difference between the 
lower and upper value. 


Multiply the difference by the decimal remainder — Multiply that difference by the 
decimal remainder needed. 


— For example, if the v'" value is determined to be 18.1, the lower value would 
correspond to the 18* data point. 


— The upper value would correspond to the 19" data point. 


— Subtract the number associated with the lower value from the number associated 
with the upper value. 


— Multiply this difference by the decimal remainder. In this case, the decimal 
remainder for 18.1 would be .1. 


Add product to smaller value — Add this product to the smaller value. 


— This yields the percentile point value. 
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MEASURES OF LOCATION — FINDING PERCENTILES 


Interpolation (Cont'd) 

Bonus Rank 
Sas Order___ Step 2: Calculate the Percentile 
259 19 
— a 18 Percentile value = Lower + [(Higher — Lower) 

47 16 x decimal remainder] 

45 15 

39 14 | = — 

3 “ 57 + [(59 — 57) x .1] 

2 | = 57 + (2.1) 

28 10 | 

25 9 = 57.2 

24 8 

23 7 

22 6 

20 5 Thus, the 90‘ percentile point 

18 ; for this example is 57.2. 

15 2 

9 1 
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Percentile Bars 
= A tool to help visualize fe | 
the distribution of data, 57.20 (Peo) 
50 
™ Based upon P.,, P.;, i 45.50 (Ps) 
P.5, P75, Po, and the 40 
mean (x) 35 33.35 
- 30.00 (Ps 
™ Can indicate the shape is sia 
of the distribution 20 anon Pas) 
- 16.80 (P49) 
oo —<—<——=- 
i 
: i 


Percentile Bars 
m In this example, the mean is bigger than the median. 


m The distance between the 25th percentile and the 50th percentile is smaller than the 
distance between the 50th percentile and the 75th percentile. 


m These relationships among the various measures of central tendency or location will be 
helpful when examining the symmetry or skewness of a distribution, as will be 
discussed in Module 8. 
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PERCENTILE BARS 


2,200 
2,100 


Monthly Salary 


10 


Using Percentile Bars with I-Bars 
KLS Salary Structure and Market Salary Data 


KLS Company Data ==> 


Market Data Na 


32 
| | 95 | 
| 1,011 
@) 
43 
2,139 Data as of 7/1 


Using Percentile Bars 


This is the combination of market salary data 


and KLS salary structure data. With the Parceniile Bare fée 
whole company’s pay structure on one chart, Survey of Companies 
it is helpful to summarise all the data. 90% percentile 


The percentile bars represent the external 


75 percentile 


market. | 50" percentile 
(median) 

The I-bars represent the internal structure. 25t percentile 

Compa-ratios and market indices can be | 40% percentile 


easily determined. 


No. People 
Compared = 99 


I-Bars for 
KLS Company 
Max. of range 


Mid. of range 
*) Mean of salaries 
Min. of range 


No. People 
Compared = 10 


What can be said about the competitiveness 


of KLS salaries and structure? 
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Can You... 


= Identify and define measures of 
central tendency, including mode, 
median and mean? 


@ Identify and define measures of 
location, including percentiles? 


™@ Discuss percentile bars? 
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Module 6 Quiz 


1. What is the mode for the following set of numbers? 
15, 18, 13, 19, 25, 24, 26, 14, 25, 35, 12, 99, 54, 13, 22, 19 and 13 


A. 
B. 26 
C. 

D. 13 


19 


24 


2. What is the median of the following data? 
Data: 7, 17, 3, 19, 13, 9, 7 


A. 
B. 
C. 
D. 


4 
9 
13 
11 


3. What is interpolation? 


A. 


A value that a given percent of the data is less than 


B. An exact value representing a remainder 
C. 
D 


. A statistical representation of standard deviation 


A whole number that represents the sum of a data set 


4. What are percentile bars? 


A. 
B. The value that occurs most often among a data set 

C. 

D. The most common method of representing unweighted means 


A tool to help visualise the distribution of data 


Representations of the median salary in an organisation's pay grades 
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(blank) 
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Module 7 


Measures of Variability 


WorldatWork B 
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Module 7: Measures of Variability 


Introduction 


Measures of central tendency do not give the complete picture of a distribution. 
Distributions can have equal means, modes and medians, but still be quite different. 
Variability represents the differences among data points, which indicate how subjects 
vary. This information is important from both a statistical and a practical perspective. In 
this module, measures of variability are reviewed. We will also address z-scores and the 
application of 2-sigma and 3-sigma rules to accurately interpret data. 


Objectives 


By the conclusion of this module, you will be able to accomplish the following: 


1. Define measures of variability, including range, interquartile range and standard 
deviation. 


2. Examine z-scores and apply 2-sigma and 3-sigma rules in order to reach accurate 
conclusions about data. 
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Measures of Variability 


= Information about dispersion 
or spread 


@ Relative perspective 


@ Are observations different 
from the mean? 


Measures of Variability 


The measures of central tendency and measures of location discussed in the last 
module provide information about locations along the distribution. The measures of 
variability are equally important, since they provide information about how the values 


within a distribution vary. 

= Information about dispersion or spread — A measure of variability provides 
information regarding the amount of dispersion or spread, that exists within a set of 
data. 

™ Relative perspective — If we know a distribution’s variability, we can put individual 
observations into relative perspective. 

= Are observations different from the mean? — Measures of variability serve as tools 
for understanding not just whether observations are different from some designated 
location, such as the mean, but how much they vary. In other words, they provide a 
benchmark for determining how unusual a value is. 
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MEASURES OF VARIABILITY 


Three Key Measures 
of Variability 


m@ Range 
@ Interquartile range 


m@ Standard deviation 


Three Key Measures of Variability 


m™ Range -— (1) for a set of data, the difference between the maximum value and the 
minimum value; (2) for a pay grade, the percentage by which the maximum pay 
exceeds the minimum. 

m™ Interquartile range — the difference between the 25th percentile (first quartile) and the 
75th percentile (third quartile) in an ordered array of data. This range contains the 
middle 50 percent of the data. 


™ Standard deviation — the square root of the average squared difference between data 
points and the mean. Standard deviation is a measure of variability which indicates an 
average relative distance between each data point and the mean. The larger the 
standard deviation, the more the data is spread out from the mean. Mathematically, it is 


the square root of the variance. 
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MEASURES OF VARIABILITY 


Variability 


m@ You have surveyed 
20 companies in each of 
two cities regarding salary 
trends in the upcoming year. 


m@ The mean for each city is 7%. 


= However, the variation in each 
city is different. 


What do you recommend for City A? Why? 
What do you recommend for City B? Why? 


1 
; 
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Range 


m@ Easiest to understand 
@ Easily calculated 
# At least ordinal level data 


m@ Distance between highest 
and lowest points 


| 


Range = Highest Value — Lowest Value 


Range 


Easiest to understand — The range is our easiest measure of variability to 
understand. 


Easily calculated — The range is easily calculated. 


At least ordinal level data — The range is appropriate for data that satisfy at least the 
ordinal level of measurement. 


Distance between highest and lowest points — The range is a measure of variability 
that tells us the distance between the highest and lowest data points in our distribution. 
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MEASURES OF VARIABILITY: RANGE 


Find the Range 


X-ray Rank Order X-ray Rank Order 


Cost Position Cost Position 
500 23 420 11 
480 22 417 10 
460 21 416 9 
453 20 414 8 
452 19 410 7 
452 18 410 6 
451 17 405 5 
450 16 397 4 
450 15 380 3 
432 14 360 2 
421 13 332 1 
420 12 

| Range = 500 — 332 = 168 | V\ 


Find the Range 


Find the range for the costs of X-rays. 


Why would knowing the range be important 


in remuneration and benefits? 
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MEASURES OF VARIABILITY: RANGE 


What Is the Interquartile Range? 


The interquartile range is a measure of 
variability based on the distance between the 
75th and 25‘ percentiles for our distribution. 


| Interquartile Range = P-_,— P,; 
=Q;- Q, 
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MEASURES OF VARIABILITY: RANGE 


Find the Interquartile Range 
| Xray -RankOrder | X-ray Rank Order — 

Cost Position Cost Position _ 
500 23 420 11 

480 22 417 10 

460 21 416 9 

453 20 414 8 

452 19 410 7 

452 18 410 6 

451 17 405 5 

450 16 397 4 

450 15 380 3 

432 14 360 2 

421 13 332 1 

420 12 


I 


Find the Interquartile Range 


Find the interquartile range for the X-rays. 


Step 1: v=[(x +100) xn] +[1—-—(x+100)] 
e For the 25" percentile: v = [(.25 x 23) + (.75)] =6.5 
e For the 75 percentile: v = [(.75 x 23) + (.25)] = 17.5 


Step 2: Percentile value = Lower + [(Higher — Lower) x Decimal Remainder] 
e For the 25 percentile: 410 + [(410 — 410) x .5] = 410.0 
e For the 75** percentile: 451 + [(452 — 451) x .5] = 451.5 
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MEASURES OF VARIABILITY: RANGE 


Answer - Find the Interquartile Range 


| Interquartile Range = P_, — P, | 
451.50 — 410 
41.50 


The value that corresponds to the 
17.5 position from the bottom is 451.50. 


The value that corresponds to the 
6.5t4 position from the bottom is 410. 


Answer - Find the Interquartile Range 
Another way of expressing this value is: 
Interquartile Range = Q,-Q, 

= 451.50 -— 410.00 

= 41.50 
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MEASURES OF VARIABILITY: RANGE 


Use of the Range 
and the Interquartile Range 


Disregarding job content, 
which job would you use 
from this survey for a 
benchmark job? 


Does knowing the 
variation in each job 
influence your decision? 
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MEASURES OF VARIABILITY: RANGE 


Answer — Use of the Range 
and the Interquartile Range 


Interquartile Range 


Based on the survey data, the means and the 
medians are equal. However, there is less variation in 
the auditor data than in the senior accountant data. 


All other factors being equal, the auditor job would be 
a better benchmark job. 
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MEASURES OF VARIABILITY: RANGE 


Remember... 


m™ The range is based on two values 


™ Interquartile range based on 
middle 50% of data 


@ Is there a better measure? 


Remember... 


The range is based on two values — The range is based on only two values. It does 
not reflect the variation of the values between these two points. 


Interquartile range based on middle 50% of data — The interquartile range is based 
on only the middle 50% of the data. It does not reflect the variation of the full set of data 
either. 


Is there a better measure? — We need a better measure...one that is based on all of 
the values in our data set. 


— Weneed to consider the standard deviation. 
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Standard Deviation 


A measure of variability that indicates 
an average relative distance between 
each data point and the mean 


™@ Uses all data points 
™ Most useful measure 
m@ Standard output 


# Definition 


Standard Deviation 


Uses all data points — The standard deviation uses all the data points from the data 
set in its calculation. 


Most useful measure — The standard deviation is the most useful measure of 
variability in statistics. 


Standard output — The standard deviation is standard output on spreadsheets and 
statistical calculators. 


Definition — The standard deviation is the “square root of the average of the squared 
deviations from the mean.” 


—  |f the standard deviation is relatively small, the data points tend to be relatively 
close to the mean. 


— If the standard deviation is relatively large, the data points tend to be relatively 
far away from the mean. 


GR2 w 7.14 


© WorldatWork. All rights reserved. 


MEASURES OF VARIABILITY: STANDARD DEVIATION 


Applications of 
the Standard Deviation 
= Identify outliers 
= Compare variation in data sets 
m@ Create confidence intervals 


= Compute z-scores 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Example — Standard Deviation 


@ Three remuneration analysts vary in their months of 
remuneration experience: 


72, 62, and 61 months 
= What is the mean number of months of experience? 


@ Do they vary much in their experience? 


Example — Standard Deviation 


On average, they deviate zero amount from each other: 


(x—-x) 0 
3 


variation = =0 


But they DO vary. The positives and negatives balance each other out, as they always will 
when subtracting the mean from all values. 


How can we resolve this problem? 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Example — Standard Deviation 
(Cont'd) 


Squared 
Deviations 
from the Mean 


Deviations | Deviations 


Months of 
Experience 


Example — Standard Deviation (Cont'd) 


If we square the differences of each of the values from the mean, all the resulting values 
will become positive. If we sum these squared differences and then divide that amount by 
the number of cases, we will have an average of the squared deviations. Let's try this 
approach: 


s\2 
variation = Be ave ye 25 


3 


But these three individuals certainly do not vary by approximately 25 months on the 
average. No two people are that far apart! 


How can we resolve this problem? 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Example — Standard Deviation 
(Cont'd) 


Example — Standard Deviation (Cont'd) 


In this approach, we squared the differences of each value from the mean, added, and 
found the average. This value is called the “variance.” However, the variance has little 
practical value to us as practitioners. 


We need to convert the value back to the original unit of measure. That is, we need to take 
the square root of the variance. Let’s try this approach: 


_ Yy\2 
Standard deviation = ye AL a a =J25=5 


The three individuals do vary from each other on average by about 5 months. 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Population Standard Deviations 
vs. 
Sample Standard Deviations 


Population Sample 
ee EIX-X): — (x - xX} 
N N-1 


Population Standard Deviations vs. Sample Standard Deviations 
How to Calculate the Standard Deviation 


Statistic symbols 


N = number of data points > = sumof 
X = the data values 0, = standard deviation of the population 
X = the mean of alln data s = standard deviation of the sample 


points (x-bar) 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Calculation of the Standard Deviation 


Example 
Three engineers have the following lengths of service: 1, 3 and 8 years. 


Procedure Press Display 
1. Change format to 1 decimal 2nd Format DEC = (Old contents) 
point (arbitrary decision). 
DEC = 1.0 
2. Select data worksheet. 2nd Data X01 = (Old contents) 


3. Clear worksheet. 


4. Enter data. 


a) Enter 1st data value. 1 Enter X01 = 1.0 


b) Enter 2"¢ data point. 3 Enter X02 = 3.0 
Move down 2 displays. X03 = 0.0 
c) Enter 3% data point. 8 Enter X03 = 8.0 


5. Select statistical calculation (Old contents) 


portion of statistics worksheet. 


6. Make sure you are in 1-V 2nd Set 1-V 
mode. 
7. View calculations. [ wv | n= 3.00 | 
¥ X= 4.0 
Sample standard deviation. S, = 3.6 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Example — Sample Data 
for Days Absent on the Job 


Mean 
Sample Standard Deviation 


Example — Sample Data for Days Absent on the Job 


We have collected sample data for employees from two different locations based 
upon the number of days absent under short-term disability within the last year. 
We have determined the mean and standard deviation for London. 


We now want to determine the mean and standard deviation for New York. 
(Use two decimal places.) 


m@ What is the mean for New York? 


m What is the sample standard deviation for New York? 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Answers — Sample Data 
for Days Absent on the Job 


Mean 
Sample Standard Deviation 


Answers — Sample Data for Days Absent on the Job 


Notice that the standard deviation for London is about three times the size of the standard 
deviation for New York. This is a strong indicator that there is much more variation in 
absenteeism for the sample of employees in London than for the sample of employees in 
New York. 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


Comparing Means and Standard 
Deviations for Two Samples 


i€ Mean = 13.92 


6 7 8 9 10 11 12 13 4 15 16 17 18 19 20 21 22 


6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 


Comparing Means and Standard Deviations for Two Samples 
Discussion 


If you learned that an employee working in London has 17 days of short-term disability, 
would this person be high relative to the other London employees? 


What about an employee working in New York who has 17 days of short-term disability? 


m™ Seventeen days in London would be high but not the highest, relative to the other 
employees in London. 


m™ Seventeen days in New York would be unusually high, relative to the other employees 
in New York. 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


The 2-Sigma Rule for 
Standard Deviations 


Example for London 


2%* 3.29 = 6.58 


| 13.92 - 6.58 = 7.34 


| 13.92 + 6.58 = 20.50 
7.34 |13.92) 20.50 


At least 75% of the data points will fall between these points: 


| 7.34 | to | 20.50 


The 2-Sigma Rule for Standard Deviations 


At least 75% of the points in a data set will always be within + 2 standard deviations from 
the mean. This rule applies to any distribution. 


What percent of the data points in this example actually fall within + 2 standard deviations 
from the mean? 


GR2 @ 7.24 


© WorldatWork. All rights reserved. 


MEASURES OF VARIABILITY: STANDARD DEVIATION 


The 3-Sigma Rule for 
Standard Deviations 


Example for London 


3 * 3.29 = 9.87 


| 13.92 ~9.87 = 4.05 


| 13.92 + 9.87 = 23.79 


4.05 (13.92) 23.79 


At least 88.9% of the data points will fall between these points: 


|4.05|to 23.79, 


The 3-Sigma Rule for Standard Deviations 


At least 88.9% of the points in a data set will always be within + 3 standard deviations from 
the mean. 
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MEASURES OF VARIABILITY: STANDARD DEVIATION 


The Standard Deviation as a Powerful Tool 
for Recognising Potential Discrimination 


Senior Clerks 


1900 
X 
Monthly Pay 


m 


The Standard Deviation as a Powerful Tool for Recognising 
Potential Discrimination 


Female Male 
Mean 1,775 1,900 
Standard deviation 18.20 44.20 
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Standard Scores (z-Scores) 


Definition: 


The distance of a value from the mean. 


Example: 


If a female is paid the equivalent of the average 
male (1,900) in Company A, what is her z-score? 


_ 1900-1775 _ g 97 
18.2 


z 


Standard Scores (z-Scores) 
™ Wecan determine whether a value is an outlier by making it relative to two statistics: 
the mean and the standard deviation. 


m™ Astandard score or z-score can be either positive or negative. A value that falls above 
the mean will have a corresponding positive z-score. A value that falls below the mean 
will have a corresponding negative z-score. In the unlikely event that a value falls 
exactly at the mean, it will have a corresponding z-score equal to zero. 


What does the standard score tell you? 


m™ fa female is paid equivalent to the average male (1,900), she would be 6.87 standard 
deviations above the average (1,775) pay for the female. Thus, such a female would 
have an extremely high pay relative to her comparison group. 


m In fact, a female earning 1,829.60 or greater would be very unusual, since according to 
the 3-sigma rule, we would expect at least 88.9% of the females to receive pay 
somewhere between 1,720.40 and 1,829.60, based on 1,775 + (3 x 18.2) 


m™ In these cases, the z-score helps us focus our attention on values that deserve 
analysis. 
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STANDARD SCORES 


Example- - (SE oe 
Applications 

of 2-Sigma, 
3-Sigma and 
z-Scores 


= =.= = 
aan 
= 
Ls) 


= = 
o 
oa aoanrhoon o 


Frequency distribution 
for the number of new 
customers acquired by 
a sample of 50 sales 
representatives 
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12 

11 
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2 
1 
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Are there any outliers? 


Example — Applications of 2-Sigma, 3-Sigma and z-Scores 
m Is 17 an outlier? 

m Is 15 an outlier? 

m Is 1 an outlier? 


Let’s create a bar graph to better visualise the distribution. 
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STANDARD SCORES 


Relative Frequencies (%) 


Identifying Outliers 


Bar Graph of New Customers Acquired for 
Sales Staff Using Relative Frequencies 


Is this an 
outlier? 


Is this an 
outlier? 


Is this an 
outlier? 


= 
— 
5 


= 
Nn 
4 


J 
1 


64 


4 323) 54 55°67 <8. 9. 10-14 12: 13° 14.15: 16:17 
Number of New Customers Acquired 
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STANDARD SCORES 


Determining and Storing the Mean 
and the Standard Deviation 


© Begin Continue 
PRESS: DISPLAY: PRESS: DISPLAY: 

2nd | Data > | X04 = (Old Contents) | iy | Y03 = 1.00 

2nd ClirWork X01= 0.00 2 | | Enter | | Y03= 2.00 
17 Enter | | X01= 17.00 | (Continue entering the grouped data 
v Y01 = 1.00 into the X and Y columns.) 
1 || Enter | [Y01= 1.00 2nd Stat 1-V 
v X02 = 0.00 iv | n= 50.00 
15 Enter X02= 15.00 ¥ X= 7.06 
v Y02 = 1.00 a0, 7 7.06 
1 Enter Y02= 1.00 Ly | Sx= 3.29 
v X03 = 0.00 | STO] |_8 3.29 
12 | { Enter | [X03 = 12.00 | i 


Determining and Storing the Mean and the Standard Deviation 


™ Notice that we do not input X02 = 16 and Y02 = 0, X04 = 14 and Y04 = 0, etc. When 
we are working with only one variable with this calculator, we cannot input zero values 
for the y-values. If we did, we would get an error message in the 1-V mode. 


m™ Therefore, we simply omit the inputting of any values that have zero frequencies when 
using this calculator. The easiest way is to draw a line through such values before we 
begin the inputting of data. 


™ Notice that we have stored the mean and the standard deviation into memory locations 
7 and 8 for easy recall later. 
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STANDARD SCORES 


Developing 2-Sigma and 
3-Sigma Boundaries 
Recalling and Using the Stored Values 


Ree 7 = ( Bee 4 2 2 5 = a 
RCL 7 + «(2 REL "8 “x 2 “j “= 4364 
[Reu|| 7 | - || ¢ |[Re]| 8 || = || 3 ||) || = |[-2.84] 
[RCL|| 7 || + |[ ( |/Rev]| 8 || x |/ 3 || ) || = |/ 16.93) 


Developing 2-Sigma and 3-Sigma Boundaries 
2-sigma boundaries: 0.48 to 13.64 
3-sigma boundaries: —2.81 to 16.93 
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STANDARD SCORES 


Conclusions 


Bar Graph of New Customers Acquired for 
Sales Staff Using Relative Frequencies 


This definitely 
appears to be 
an outlier. 


appears to be 
an outlier. 


Relative Frequencies (%) 


T ; TNT A 
9 10 11 12 13 :14 15 16 
Number of New Customers Acquired ~ 


Conclusions 


m™ Acquiring 15 or more new customers is unusual, as these fall outside the 2-sigma 
boundary (no more than 25% will fall outside the 0.48 to 13.64 boundary). 


— We should red-circle these for further examination. 


™ Acquiring 17 or more new customers is very unusual, as this falls outside the 3-sigma 
boundary (no more than 11.1% will fall outside the —2.81 to 16.93 boundary). 


— We should double red-circle these for serious review. 


™ Note that when using the 3-sigma rule, the lower boundary is less than zero. This is not 
uncommon when applying the rule to real world situations, so you should not become 
alarmed. Of course, no one will have a negative value for the number of new 
customers acquired. However, the rule still holds. You will notice that at least 88.9% of 
the cases will fall between —2.81 and 16.93. In reality, at least 88.9% of the cases fall 
between 0 and 16.93. 
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STANDARD SCORES 


Applications of 2- and 3-Sigma Rules — 
Monitoring Employee Production Levels 
Over 30 Days 


Run Chart Plotting Daily Production 


+3 SD 
+2 SD 
+1 SD 
Mean 
-1SD 
-2SD 
-3SD 


Number of Parts Produced 


T r T T T a re es T T T T T T T T T T T T cle T T T T T 
23.4 5 6 7 8B 9 10 1112 13 1415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 
Days of the Month 


Applications of 2- and 3-Sigma Rules 


We often create run or control charts to help identify outliers. 


The data must consist of a series of observations or groups of observations over time 
(such as hours, days of the month, years, etc.). 


When there is only one observation per time period, they are referred to as “run charts 


or “individual charts.” 


The graph above includes the following lines and data points: 


The series of data points (daily production rates for each of 30 days) 
The centre line (based on the mean) 

Lines corresponding to 1 standard deviation above and below the mean 
Lines corresponding to 2 standard deviation above and below the mean 
Lines corresponding to 3 standard deviation above and below the mean 


Notice that any observations falling outside the 2-sigma zone have been red-circled. 


Notice that any observations falling outside the 3-sigma zone have been double 
red-circled. 


In these cases, the z-score helps us focus 


our attention on values that deserve analysis. 
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STANDARD SCORES 


More Conclusions: 
Applications of 2-Sigma, 3-Sigma 
and z-Scores 


Notice that finding the 
z-score associated 
with acquiring 17 new 
customers leads to 
the same conclusion 
as using the 2- or 
3-sigma rule: 

it is an outlier. 


More Conclusions: Applications of 2-Sigma, 3-Sigma and z-Scores 


The value 17 converts to a z-score of 3.02, which is more than 3 standard deviations from 
the mean. This is a strong suggestion that 17 is unusual. 
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Can You... 


= Define measures of variability, 
including range, interquartile range 
and standard deviation? 


m™ Examine z-scores and apply 2-sigma 
and 3-sigma rules in order to reach 
accurate conclusions about data? 
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Module 7 Quiz 


1. A range can best be described as which of the following? 


A. 
B. 
C. 


D. 


A measure of central tendency 
The difference between the maximum value and the minimum value in a data set 


A tool used to measure the variation of a data set based on the middle 
50 percent of the cases 


The only measure of variability appropriate for ratio data 


Which of the following is a characteristic of a standard deviation? 
A. 
B. The standard deviation reflects the variation of values between two points. 

C. 

D. At least 95% of the points in a data set will always be within +2 standard deviations 


The standard deviation is based on the middle portion of the data. 


The standard deviation uses all the data points from the data set in its calculation. 


from the mean. 


Suppose your monthly pay is 3,000. The mean for all employees in your grade is 


3,500, and the standard deviation is 200. What is your z-score relative to this group of 
employees? 


A. 
B. 
C. 


D 


3.0 
2.5 
—2.5 
-3.0 


According to the 2-sigma rule, which of the following is true? 


A. 


Ninety-five percent of the data points will fall within two standard deviations of the 
mean. 


Only 75 percent of the data points will fall within two standard deviations of the 
mean. 


At least 75 percent of the data points will fall within two standard deviations of the 
mean. 


The 2-sigma rule only applies to nominal level data. 
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Module 7 Quiz (Cont'd) 


5. The average seniority for your company is 16 years, and the standard deviation for 
seniority is one year. You have been with the company for 19 years, and your sister 
has been with the company for 13 years. Which of the following is not necessarily 
true? 


A. Your “years of seniority” value is three standard deviations above the average 
employee in your company. 


B. Your sister has a z-score of —3.00 in regards to her seniority with the company. 


C. The seniority for at least 88.9 percent of the employees in this company will fall 
between your sister’s seniority and your seniority. 


D. You are the most senior employee in your company. 
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(blank) 
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Module 8 


Statistics: 
Shapes of Distributions 


WorldatWork AB 
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Module 8: Statistics: Shapes of Distributions 


Introduction 


In previous modules, we addressed numerical approaches to understanding data. In this 
module, we will look at visual methods of representing data: graphs and distribution 
shapes. 


Objectives 


By the conclusion of this module, you will be able to accomplish the following: 
1. Identify and interpret various shapes of distributions. 


2. Identify characteristics of a normal distribution. 
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Shapes of Distributions 


Flat Symmetric Approximately 
but not symmetric and flat bimodal 
TI) LECCE alll... 


Symmetric and 
“bell” shaped 


Skewed with a Skewed with a 
tail to the left tail to the right 


Shapes of Distributions 


Distributions can take many shapes. We can better understand relationships among 
data by visualising their distribution. Distribution shapes give a visual representation of 
how the distribution is “centred” and how the data points are dispersed. 


Questions to ask: 

m™ Is the shape symmetrical? (Is one side a mirror image of the other side?) 

m@ Is it tall and skinny? 

m@ ls it short and squatty? 

m Is it bell-shaped? 

If it is not symmetrical, then it is skewed (with a tail trailing off in one direction). 

The visual characteristics of distributions have corresponding mathematical values. 


lf the distribution is perfectly symmetric with one and only one mode, the mean, median, 
and mode will be equal. If the distribution is perfectly symmetric and there is more than one 
mode (e.g., bimodal), the mean and median will be equal to each other but not to the 
mode. 


A flat distribution has no mode but is not necessarily symmetric. 


If mean, median and mode are equal, this does not mean the distribution will necessarily 
be normal. 
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SHAPES OF DISTRIBUTIONS 


Example — Survey Data 


Bar Graph of Accountant Salaries 
from Survey Data 
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Weekly Salaries 


Example — Survey Data 


In this example, there are two different sets of data to be compared. The current graph 
represents survey data. We should not be surprised to find that it has considerable 
variation and is somewhat skewed. Notice the positional relationship of the mean to the 
bulk of the data. 
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SHAPES OF DISTRIBUTIONS 


Example — Company Data 


Bar Graph of Accountant Salaries 
from KLS Company 
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Weekly Salaries 


Example — Company Data 


This graph represents a company’s data and can be compared to the previous graph taken 
from survey data. We should not be surprised to find that it has considerably less variation 
than the survey data. In this case, it is also much more symmetric. However, we do not 
want to conclude that it is “normal” (bell-shaped). 
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SHAPES OF DISTRIBUTIONS 


Example — Both Data Sets 


Bar Graph Comparing 
Survey Data to KLS Data 


Survey Mean —} i+ Company Mean 


Relative Frequencies (%) 


/ oll a 
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549 «599 649 699 749 799 849 899 949 999 1049 1099 1149 


Weekly Salaries 


Example — Both Data Sets 


It is not unusual for a picture to look like this, where the survey data are wider and higher 
and your own company data are closer together. 


How does the distribution of pay for the 


KLS company compare to the survey data? 
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SHAPES OF DISTRIBUTIONS 


Other Examples of Shapes of 
Distributions — Performance Ratings 


$68 


Absolute Frequencies 
Absolute Frequencies 


123 45 6 7 8 9 10 
Rating 


5.8 7 8 9 10 
Rating 


42 2 4 


Operations Marketing 


Absolute Frequencies 
868 8 


= 
—} 


8 
: 
£ 
3 
: 


4723 465 6 7 68 123 4 5 6 7 & 9 10 
Rating Rating 


Other Examples of Shapes of Distributions — Performance Ratings 


Distributions can take on different shapes, and we can better understand the 
characteristics of the data based on their shapes. 


Notice that marketing is “bimodal.” While the two frequencies are not identical, for all 
practical purposes we should recognise that there are two “modal” categories of ratings. If 
we had used Excel’s built-in function for the mode and had not graphed the data, we would 
have concluded that the mode is 7 and would not have recognised the bimodality. 
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SHAPES OF DISTRIBUTIONS 


Characteristics of Nonsymmetric 
(Skewed) Distributions 


Skewed with a 
tail to the right 
(positive skewness) 


Relative Frequencies (%) 


The Variable of Interest 


Characteristics of Nonsymmetric (Skewed) Distributions 


A nonsymmetric distribution has one side not balanced against the other side. Thus, the 
distribution is said to be skewed. Distributions may be skewed with a tail to the left or 
skewed with a tail to the right. The salary distributions for most organisations are skewed 
to the right. 

Generally speaking, the mode, median and mean follow a particular order in skewed 
distributions. In distributions skewed with a tail to the right, the mean typically has a bigger 
numerical value than the median, which in turn has a bigger numerical value than the 
mode. We refer to this as a positively skewed distribution. In such distributions, the most 


extreme “outliers” typically have positive z-scores. 


m Examples 


The number of factors typically used when making executive remuneration 
decisions by organisations is generally positively skewed. Some organisations 
might use only one factor, while others might use two, three, four, five, or six 
factors; in fact, some organisations might consider seven, eight, or more factors. 
The distribution of ages for employees in the IT sector is generally skewed with a 
tail to the right. 

The distribution of test scores for a set of students who have not studied at all for 
an exam is generally positively skewed. 

The distribution of pay for major league baseball players is typically skewed with 
a tail to the right. 
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SHAPES OF DISTRIBUTIONS 


Characteristics of Nonsymmetric 
(Skewed) Distributions 


Mode 


Median 


Mean ; Bs 
MALL Ff) | 


Skewed with a 
tail to the left 
(negative skewness) 


#3 
& 
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& 
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The Variable of int 


Characteristics of Nonsymmetric (Skewed) Distributions 


In distributions skewed with a tail to the left, the mean has a smaller numerical value than 
the median, which in turn has a smaller numerical value than the mode. We refer to this as 
a negatively skewed distribution. In such distributions, the most extreme “outliers” typically 


have negative z-scores. 
m Example 


— The distribution of test results for a group of participants enrolled in a T3 seminar 
(or other WorldatWork certification classes), after they have completed the 
seminar, is typically negatively skewed, with far more individuals earning high 


scores than individuals earning low scores. 
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The Normal Distribution 


Mean 
Median 
Mode 
> 
o 
< 
® 
=} 
og 
2 
LL 
-3 —2 -1 0 +1 +2 +3 
+1 Standard >| 
Deviation 


(includes 68% of the data) 


je +2 Standard Deviations >| 
(includes 95.45% of the data) 


+3 Standard Deviations 
(includes 99.73% of the data) 


The Normal Distribution 

A normal distribution (often referred to as a “bell-shaped” curve) is a distribution that often 
occurs in naturally occurring phenomena. Heights and weights of humans are generally 
normally distributed. The yields of corn are often normal. The diameters of apples are 
generally normally distributed. Pay is usually not normally distributed, nor is performance. 
Do we want performance to be normally distributed? 

Notice that the area between —2 standard deviations and +2 standard deviations is now 
95.49%. When we have a perfectly normal distribution, the area between —2 and +2 
standard deviations increases from “at least 75%” to “precisely 95.45%.” 
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THE NORMAL DISTRIBUTION 


Summarising the 
Normal Distribution 


m@ Approximates the distribution of 
naturally occurring phenomena 


= An essential tool sometimes misused 


@ Not perfectly normal 


Summarising the Normal Distribution 


= Approximates the distribution of naturally occurring phenomena — Most data, 
including remuneration data, are not perfectly normal. We sometimes hope that our 
data are approximately normal, but this should not be our goal. 


= Anessential tool sometimes misused — The normal distribution is an essential tool 
for inferential statistics that is sometimes misused to force a desired outcome. 


= Not perfectly normal — Most data, including remuneration and benefits data, are not 
perfectly normal. 


The normal distribution does not apply to things 


that do not naturally occur. For example, 
performance ratings would not apply. 
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Percentile Bars 


Percentile bars visually represent distributions 
Skewed with a tail to the right 
Mean to the right of the median 


Distance between P_, and P,, greater than 
distance between P,,, and P., 
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Bonus Points for Madrid Field Representatives 
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Can You... 
m@ Identify and interpret various 
shapes of distributions? 


@ Identify characteristics 
of a normal distribution? 
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Module 8 Quiz 


1. Adistribution that is skewed with a tail to the left is which of the following? 


A. Positively skewed 
B. Symmetric 
C. Negatively skewed 


D. Normal 


2. In positively skewed distributions, the mean is usually which of the following? 


A. Larger than the median 
B. Between the median and the mode 
C. Smaller than the median 


D. Equal to the median 


3. Which of the following is characteristic of a normal distribution? 


A. lt represents a perfect distribution of data. 
B. It generally is skewed with a tail to the right. 
C. It is best used to force a desired outcome. 


D. It approximates the distribution of naturally occurring phenomena. 
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Module 9 


Regression Analysis 


WorldatWork ZB 
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Module 9: Regression Analysis 


Introduction 


Models help us solve problems by clarifying relationships between data sets. You can 
solve pay-related problems or predict pension income, but first you need to find the critical 
factors in the data and quantify the relationship. This module provides an overview of how 
to build a mathematical model and develop a two-variable model, using the tools required 
for each step. 


Objectives 


By the conclusion of this module, you will be able to accomplish the following: 
1. Identify applications of regression models in an HR environment. 

2. Develop a regression model. 

3. Identify cautions in the interpretation of correlations. 

4 


Explain why one would use a multiple regression model and identify potential 
applications. 
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Applications of 
Regression Models 
@ Establish pay models 
@ Establish guidelines 
@ Predict pension income 
@ Evaluate pay-for-performance 


@ Explain variations 


Applications of Regression Models 


Establish pay models — Establish pay models based on relationships between job 
worth and market pay. Applications may also include the setting of policies (via 
establishment of pay lines, for example). 


Establish guidelines — Establish internal pay guidelines based on job experience. 
Applications also may include auditing (pay-increase percentage in relation to 
protected classes, for example). 


Predict pension income — Predict pension income by examining retirement plan 
components and savings. Applications also may include prediction of benefit cost 
increases, etc. 


Evaluate pay-for-performance — Evaluate pay-for-performance models by examining 
relationships between pay and performance. 


Explain variations — Explain variations in, for example, salary or share performance. 
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Steps for Developing 
a Regression Model 


1. Specify issue/ problem 
2. Generate critical factors 


3. Collect data and determine the 
mathematical relationships 


4. Develop the model, quantify 
and interpret the relationship 


5. Evaluate the model 


6. Take appropriate action 


Steps for Developing a Regression Model 
1. Specify issue/problem. 


— Determine the issue or problem you are attempting to solve. This issue or 
problem becomes your y-variable, the dependent variable. 


2. Generate critical factors. 


— Which ones would be most appropriate for the issue/problem? 
These factors become your potential x-variables, the independent variables. 


3. Collect data and determine the mathematical relationships. Determine the 
mathematical relationships, if any, between the critical factors and the issue/problem. 


— Collect data. 

— Plot the data. 

— Determine the relationships. 
4. Develop the model, quantify and interpret the relationship. 

— Quantify the relationship using linear regression and interpret the results. 
5. Evaluate the model. 

—  fvaluate the strength of the linear regression model. 

— Interpret the results (coefficients, equation, r-squared). 


6. Take appropriate action. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 1 


Step 1 — Specify Issue/Problem 


= What is the issue or problem? 


— What is the variable of interest ? 
— What is the y-variable? 
— What is the dependent variable? 


What is the criterion variable? 


Step 1 - Specify Issue/Problem 


= What is the issue or problem? Specify the issue or problem. The identification of the 
particular issue or problem defines the y-variable or the dependent variable. This is the 
variable that we are trying to explain or predict. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 2 


Step 2 - 
Generate Critical Factors 


m Is pay a function of years of experience? 


m@ Is pay a function of organisational level? 


—™ Conclusion 


— Pay is a function of many critical factors. 


Step 2 — Generate Critical Factors 


Are there any other variables (critical factors) available or under consideration related to 
the variable of interest? 
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STEPS FOR DEVELOPING A REGRESSION MODEL - STEP 3 


Step 3 — Collect Data and Determine 
the Mathematical Relationships 


Annual Pay vs. Years of Experience 
and ag Level 


ao orgg vee ee 
Name Annual Pay arene ows 


*41 = President 


2 = Next level down 


6 = Lowest level 


Step 3 — Collect Data and Determine the Mathematical Relationships 


m Plot data. Determine if there is a relationship. (What sort of relationship do you expect 
to find?) 


@ Organisational level is defined as follows: 


1 = President 


| 


= Next level down 


= Lowest level of organisation 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 3 


Step 3 — Collect Data and Determine 
the Mathematical Relationships (Cont'd) 


Plot data points in order to: 


™ Create a visual image of the data 


Identify unusual cases (outliers) 


= 
= Identify patterns or trends 

= Examine relationships among variables 
ca 


Determine appropriate models 


Step 3 - Collect Data and Determine the Mathematical Relationships (Cont'd) 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 3 


Step 3 — Collect Data and Determine 
the Mathematical Relationships (Cont'd) 


Steps for plotting points: 
@ Identify dependent and independent variables 
™@ Decide on scaling for each variable 

— Minimum value on scale 

— Maximum value on scale 

— Increments 


@ Plot the points 
@ Present the graph 


Step 3 — Collect Data and Determine the Mathematical Relationships (Cont'd) 


Steps for plotting points: 


Identify dependent and independent variables — First, identify the dependent (y) and 
independent (x) variables. The data we collected are a set of (x, y) pairs. 


Decide on scaling for each variable — Next, decide on the scaling for each variable. 


— Minimum value on scale — Try to make sure the minimum value is lower than or 
at the lowest value for the variable. 


— Maximum value on scale — The maximum value should be higher than the 
highest value for the variable to ensure that the points fall within the graph with no 
points on the edges. 


— Increments -— Try to set logical increments. 


Plot the points — Then plot the (x, y) pairs on the graph and present the graph [without 
the (x, y) pair labels]. 


Present the graph 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 3 


Step 3 — Collect Data and Determine 
the Mathematical Relationships (Cont'd) 


Annual Pay vs. Years of Experience 


50,000 +( What might ) 
explain LU 
45,000 Brian’s 4 2 
> s 
40,0007 pay? lt = 
S 35,000 
s 30,000 « 
— 25,000 a 
© | r 
2 20,000 «= 
c 
< 15,000 ie 
10,000 " 
5,000 
ty) 
0 5 10 15 20 
Years of Experience (X) 


Step 3 - Collect Data and Determine the Mathematical Relationships (Cont'd) 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 3 


Step 3 — Collect Data and Determine 


the Mathematical Relationships (Cont'd) 
Annual Pay vs. Years of Experience 
1. Is there a relationship? 
2. Is it perfect? 
3. Are there any outliers? 
4. If there is a relationship, is the relationship 
linear? 
5. If it is linear, is it positive or negative? 
6. If it is nonlinear, what is the nonlinear pattern? 
7. Does the graph reflect what you expected? 


| 


Step 3 — Collect Data and Determine the Mathematical Relationships (Cont'd) 


After plotting the points, always ask yourself these questions in regards to a possible 
relationship between the critical factor selected (the x-variable or independent variable) 
and the issue/problem (the y-variable or dependent variable). 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 3 


Step 3 — Collect Data and Determine 
the Mathematical Relationships (Cont'd) 


Annual Pay vs. Organisational Level 


50,000 
45,000 — Does 
40,000 —————= organisational 
level impact 
= 35,000 \ Brian’s pay? 
= 30,000 & 
— 25,000 
$ 
E 20,000 
< 15,000 


10,000 
5,000 


Organizational Level (X) 


Note: Two data points fall at the coordinates of 4 and 23,000. 


Step 3 - Collect Data and Determine the Mathematical Relationships (Cont'd) 


™ How should we respond to these seven questions regarding the relationship between 
“organisational level” and “annual pay” on the previous graph? 


—™ Does organisational level help to explain why Brian’s pay appeared to be an outlier 
when looking at the graph that was based on “years of experience” and “annual pay”? 
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STEPS FOR DEVELOPING A REGRESSION MODEL - STEP 3 


Step 3 — Collect Data and Determine 
the Mathematical Relationships (Cont'd) 


What relationships are indicated by each graph? 


Step 3 — Collect Data and Determine the Mathematical Relationships (Cont'd) 


™ Apply the relationship questions to these six graphs. 


1. 


ate Se OR 


Is there a relationship? 

Is it perfect? 

Are there any outliers? 

If there is a relationship, is the relationship linear? 
If it is linear, is it positive or negative? 

If it is nonlinear, what is the nonlinear pattern? 


Does the graph reflect what you expected? 


If the plotted points had looked like these, for which 


graph(s) would a linear model appear to be useful? 
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Step 3 — Collect Data and Determine 
the mae malice! poevorap (Cont'd) 


~ Graph — 


Questions 
to Ask 
1. Relationship? Yes Yes Yes Yes Yes Yes 


2. Perfect? 


Yes 
(It would be, 
without the outlier.) 


pf eto Pm |e |e 


a a a 
NA* NA* NA* NA* NA* 


3. Outliers? 


4. Linear? 


+k 


5. Positive or Positive 


negative? 


6. Nonlinear 
model? 


7. Meets 
expectations? 
* For these examples, we will not know ** After eliminating the outlier, there is no 
what is expected since we have not variation in the x-variable. Therefore, 
identified the x-variable or the y-variable. this question would not apply (NA). 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 - 
Develop the Model 


= Develop the model, quantify and 
interpret the relationship 


— Does the model appear to be linear 
or nonlinear? 


— What mathematical equation best 
captures the data? 


Step 4 — Develop the Model 
= Develop the model, quantify and interpret the relationship. 


— Quantify the relationship using linear regression and interpret the results. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 - 
Develop the Model (Cont'd) 
Straight Line Model 


y=atbx | 


dependent (criterion) variable 

independent (predictor) variable 
intercept 
slope 


oreo KX < 
il 


The most common model in remuneration 
is the linear (straight line) model. 


Step 4 — Develop the Model (Cont'd) 
Straight Line Model 


What is the equation for a straight line model? 


y=atbx 

where: y = dependent (criterion) variable 
x = independent (predictor) variable 
a =_ intercept 
b = slope 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 - 
Develop the Model (Cont'd) 
Straight Line Model 


The simplest mathematical model is the straight line. 


Salary 


Evaluation Points 


Step 4 — Develop the Model (Cont'd) 
Straight Line Model 
We use a mathematical model to describe the relationship between variables in a 


quantitative perspective. 
In remuneration and benefits, data are quite frequently quantitative, so mathematical 


models are quite commonly used. 


The best models are the ones 


that best capture the data with 
a minimum amount of error. 
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STEPS FOR DEVELOPING A REGRESSION MODEL - STEP 4 


Step 4 —- 
Develop the Model (Cont'd) 
Straight Line Model 


= Most common regression model 
in remuneration 


@ y=atbx 


™@ Each combination of intercept and slope 
gives a uniquely different straight line 


@ For perfect models, every (x, y) pair falls 
on line 


Step 4 — Develop the Model (Cont'd) 
Straight Line Model 


= Most common regression model in remuneration — A straight line model is the most 
common mathematical model used in remuneration. 


m@ y=a+tbx-— This is the general formula for the straight line. 


™ Each combination of intercept and slope gives a uniquely different straight line — 
Each combination of intercept and slope (a and b) represents a uniquely different 
straight line. 


m For perfect models, every (x, y) pair falls on line — For a unique straight line, every 
(x, y) pair that satisfies the equation must fall on the line when graphed. 
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STEPS FOR DEVELOPING A REGRESSION MODEL —- STEP 4 


Step 4 —- 
Develop the Model (Cont'd) 
The Intercept 


Example: y = 10 + 5x 


@ What is the intercept? 


— The intercept is the point on the y-axis 
where the line crosses the y-axis 
corresponding to an x-value of zero. 


Step 4 — Develop the Model (Cont'd) 
The Intercept 
Example: y = 10 + 5x 


m All linear, two-variable models have four components: y, a, b, and x. The letter “a” 
represents the intercept for the line of best fit. 


GR2 # 9.19 


© WorldatWork. All rights reserved. 


STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 — Develop the Model (Cont'd) 
The Intercept 


Example: y = 10 + 5x 


Step 4 — Develop the Model (Cont'd) 
The Intercept 
Example: y = 10 + 5x 


m™ Ascanbe seen in the graph, the straight line that passes through these two points 
crosses the y-axis at 10, corresponding to an x-value of 0. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 - 
Develop the Model (Cont'd) 
The Slope 
Example: y = 10 + 5x 
m What is the slope? 
— The slope is the “tilt” of the line. 


— Itis a ratio of the change in the y-values 
relative to the change in the x-values. 


— For straight line models, this ratio is a 
constant value. 


Step 4 — Develop the Model (Cont'd) 
The Slope 
Example: y = 10 + 5x 


@ All linear, two-variable models have four components: y, a, b, and x. The letter “b” 
represents the slope for the line of best fit. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 — Develop the Model (Cont'd) 


The Slope 
Example: y = 10 + 5x 


Step 4 — Develop the Model (Cont'd) 
The Slope 
Example: y = 10 + 5x 


= All linear, two-variable models have four components: y, a, b, and x. The letter “b” 
represents the slope of the line of best fit. 


™ To determine the slope of the line between these two points, use the formula on the 
next page. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 —- 
Develop the Model (Cont'd) 


Example: y = 10 + 5x 


| Slope: Changeofy's  Y2 —¥1 
| Change of x's Xx,-X, 


| Slope; 29-15 - 15 _ 5 
4-1 3 


Step 4 — Develop the Model (Cont'd) 
m™ Inthis example, the slope of the line is 5. 


m™ The slope of 5 means that for every increase of 1 unit on the x-variable, there is a 
corresponding increase of 5 units on the y-variable. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 —- 
Develop the Model (Cont'd) 
Example: y = 10 + 5x 


30 


Step 4 — Develop the Model (Cont'd) 


How many points fall on the line? 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4- 
Develop the Model (Cont'd) 
Example: y = 18 — 2x 


» 20 + Point 1 (x,, y;) 
co 
me) 
15 
7) 
° 
5 10 
E 
7 ° Point 2 (x,, y.) 
aay + 4 + 
0123 4 5 6 7 8 9 10 
X = pay grade 
Step 4 — Develop the Model (Cont'd) 
Example: y = 18 — 2x 
Point1: x, =0,y, =18 
Point2: x, =9,y, =0 
Slope : Yo Fi O18 S18 9 
X,-X, 9-0 9 
OR 
Slope : Woe Ve 2 1807 18.2 
X,-X, O-9 -9 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4- 
Develop the Model (Cont'd) 


Determining Salary 


y = 7,000 + 40x 
relates a company’s salary to evaluation points 


Where: y = Annual salary 
x = Evaluation points 


What annual salary corresponds to 
600 evaluation points? 


7,000 + (40 x 600) 
7,000 + 24,000 
31,000 


Solve for y: y 


Step 4 — Develop the Model (Cont'd) 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 —- 
Develop the Model (Cont'd) 


Determining Salary 
Example: y = 7,000 + 40x (solving for x, given y) 


What evaluation If y = 7,000 + 40x 
point vale and if y = 39,000 
corresponds 
to a 39,000 then 39,000 = 7,000 + 40x 
salary? 39,000 — 7,000 = 40x 
32,000 = 40x 
_ 32,000 
40 
x = 800 


Step 4 - Develop the Model (Cont'd) 


Graph the equation on the next page. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 —- 
Develop the Model (Cont'd) 


Locating the Point 
Example: y = 7,000 + 40x 


Annual Midpoint Dollars 
(thousands) 


200 400 600 800 1000 


Evaluation Points 


Step 4 -— Develop the Model (Cont'd) 


What is the ratio that defines the slope? 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4- 
Develop the Model (Cont'd) 


Determining Salary 


= Determine the straight line 
that best fits data 


= Method of least squares 


Step 4 — Develop the Model (Cont'd) 


Determining Salary 


= Determine the straight line that best fits data — We have the data points and we 
know there is a linear relationship, now we need to find the straight line that best fits 


the data we have. 


= Method of least squares — We will use the method of least squares to find the line of 
best fit for our data. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 4 


Step 4 —- 
Develop the Model (Cont'd) 
Method of Least Squares 


Minimises the sum of squares of these 
deviations (vertical distances} and*) 


Step 4 -— Develop the Model (Cont'd) 


Linear regression is the process of fitting the best straight line through the data we have. 
To do this, we use the method of least squares. This method develops the “best fit” line, 
where the sum of the squared vertical distances between each point and the line is the 
lowest possible. It positions the best fit line between the data points to minimise the sum 
of the squares of the vertical distances between each data point and the line. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 5 


Step 5 —- 
Evaluate the Model 
m@ Is the model any good? 
= How strong is the relationship? 


= How much of the y-variable 
can be accounted for from a 
knowledge of the x-variable(s)? 


Step 5 — Evaluate the Model 


Evaluate the strength of the linear regression model. Interpret the results (coefficients, 
equation, r-squared). 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 5 


Step 5 - 
Evaluate the Model (Cont'd) 
Tools for Evaluating the Model 

@ r (correlation) 


H Coefficient of determination 
_ r2 
— RSQ 


= Common sense 


Step 5 — Evaluate the Model (Cont'd) 


Tools for Evaluating the Model 


@ r (correlation) 


Calculate the correlation coefficient 


How well do the x and y variables go together? 


= Coefficient of determination (r2or RSQ) 


Convert the r-value to an r2 value (i.e., RSQ) and obtain the coefficient of 
determination. 


The RSQ represents the percentage of the total variation in the y-variable that 
can be attributed to or explained by the relationship with the x-variable(s). 


This value provides a practical interpretation of the relationship. 


It helps us determine the strength of our model. 


= Common sense 


Do the results and interpretations make sense? 
Are they consistent with our expectations? 


Are we using representative data and an adequate amount of data to make sound 
conclusions? 
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STEPS FOR DEVELOPING A REGRESSION MODEL - STEP 5 


Step 5 —- 
Evaluate the Model (Cont'd) 
Simple Interpretation of Correlation (r) 


=@ Between —1.0 and +1.0 


@ If r=+1.0, al// the points fallina 
straight line with a positive slope 


m@ If r= —1.0, a// the points fall ina 
straight line with a negative slope 


Step 5 — Evaluate the Model (Cont'd) 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 5 


Step 5 —- 
Evaluate the Model (Cont'd) 


Simple Interpretation of Correlation (r) 


= The closer to +1 or —1, the 
stronger the relationship 


m@ The closer to 0, the weaker 
the relationship 


Step 5 — Evaluate the Model (Cont'd) 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 5 


Step 5 —- 
Evaluate the Model (Cont'd) 


Simple Interpretation of Correlation (r) 


Weaker r= 0.00 Weaker 
negative , positive 
No 
correlation 
r=0.00 


Step 5 — Evaluate the Model (Cont'd) 


The range of correlation coefficients can be visualised by this illustration. 


A zero correlation indicates that the two variables are unrelated. 
A correlation of +1.00 indicates the two variables are perfectly positively correlated. 
A correlation of —1.00 indicates the two variables are perfectly negatively correlated. 


The further away a correlation is from 0, the stronger the correlation. 
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STEPS FOR DEVELOPING A REGRESSION MODEL —- STEP 5 


Step 5 —- 
Evaluate the Model (Cont'd) 


Correlation 
Positive Negative No Error 
correlation correlation correlation message 


M<lececcecce 


Step 5 — Evaluate the Model (Cont'd) 


Correlation 


The sign of the correlation — positive or negative — is the same as the sign of the slope of 
the least-squares line. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 5 


Step 5 —- 
Evaluate the Model (Cont'd) 


Significance 


@ Statistical significance 
and value 


@ Practical value 
m@ r? (RSQ) 


Step 5 — Evaluate the Model (Cont'd) 


Significance 


= Statistical significance and value 


If a correlation coefficient is not “statistically significant,” it has little or no value. 


Even if a correlation is “statistically significant,” it might not have any practical 
significance or value. 


= Practical value 


Practical value goes beyond statistical significance. 


Practical value allows us to draw meaningful conclusions from our findings. 


m r2 (RSQ) 


r2 (RSQ) represents the percentage of the total variation in the y-variable that can 
be attributed to or explained by the relationship with the x-variable. 


r2 (RSQ) is a tool that provides a practical level for interpretation purposes. 


r2 (RSQ) is also a tool that can be interpreted and understood by users without 
requiring high levels of statistical sophistication. 
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STEPS FOR DEVELOPING A REGRESSION MODEL - STEP 5 


Step 5 — 
Evaluate the Model (Cont'd) 


Determining Whether a Correlation 
Is “Statistically Significant” 


@ A correlation coefficient should reflect a “statistically 
significant” relationship. 


@ Anything can happen by “chance.” 


m™ To determine whether the correlation is statistically 
significant, r-values are usually converted to t-values. 


m™@ These t-values are then compared to benchmark 
t-values in a t-table to determine whether the 
relationship is statistically significant. 


Step 5 — Evaluate the Model (Cont'd) 


Many applications of correlation coefficients include a test to determine whether or not the 
correlation coefficient is “statistically significant.” Such tests are performed by converting 
the correlation coefficient to a t-value and then comparing the t-value to some benchmark 
value in a standardised t-table. 


Most software tools automatically perform this test and report its value in the output, 
including Excel, SPSS, SAS, Minitab, and other statistically oriented software. We will not 
perform a true statistical test here; however, you should be aware that such tests are 
commonly used in practice. 
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STEPS FOR DEVELOPING A REGRESSION MODEL — STEP 6 


Step 6 — 
Take Appropriate Action 
@ Apply the model. 
= Improve decision-making. 
m@ Make better predictions. 


@ Solve problems. 


Step 6 — Take Appropriate Action 
m@ Apply the model. 

m@ Improve decision-making. 

m@ Make better predictions. 
= 


Solve problems. 


Now that we have learned the six steps 


of developing a regression model, 
we will review a specific example. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 1 


Example — Developing a 
Pay Model for Machinists 


1. Specify Issue/Problem 


Design a pay model to determine 
appropriate pay rates for machinists. 


@ Pay is the variable of interest. 
@ Pay is the dependent variable. 
@ Pay is the y-variable. 


@ Pay is the criterion variable. 


Example — Developing a Pay Model for Machinists 


Step 1. Specify issue/problem. 


In this example, we are going to design a pay structure to use in determining the 
appropriate pay rates for machinists. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 2 


Pay Model for Machinists 


2. Generate Critical Factors 


Differences 
in 
Performance 
Levels 


Differences 
in Employers 
& Industries 


Differences A 
Differences 


ot Work ia, Pay Rate in Levels of 
Experience ie Education 


Machinists 


Differences 
in Labour 
Market 
Conditions 


Differences 
in Content 
of Work 


Differences 
in 
Machines 


Pay Model for Machinists 


Step 2. Generate critical factors. 


This is where you sit back and think of all the relevant critical factors that could be 
impacting your problem or issue. These become the factors that we may use to explain or 
predict the dependent or y-variable (the issue or problem). These critical factors become 
the potential x-variables or independent variables that may impact the problem or issue. 


For this example, we want to design a pay structure to use in determining pay rates for 
machinists. There are many critical factors that could impact machinist pay. We have listed 


several. 


Which might be the best critical factor 


to use in our model? 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 2 


Pay Model for Machinists 


2. Generate Critical Factors (Cont'd) 


™@ Many potential critical factors 


™@ Bivariate regression 


™ Multiple regression model 


Pay Model for Machinists 


Step 2. Generate critical factors. (Cont'd) 


Many potential critical factors — As illustrated, many variables may impact pay. 


Bivariate regression — We will limit ourselves to bivariate regression (also referred to 
as simple regression) at this point. We will discuss multiple regression later. 


Multiple regression model — If we used all these variables, we would have a multiple 
regression model. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 2 


Pay Model for Machinists 


2. Generate Critical Factors (Cont'd) 


Differences 
in 
Performance 
Levels 


Differences 


in Employers 
& Industries 


Differences 
in Years 


Differences 
in Levels of 
Education 


Pay Rate 
for 
Machinists 
(y) 


Differences 
in Labour 
Market 
Conditions 


Differences 
in Content 
of Work 


Differences 
in 
Machines 


Pay Model for Machinists 

Step 2. Generate critical factors. (Cont'd) 

We have arbitrarily decided to use experience to develop our pay model. 
m™ Experience is the critical factor in the model. 

m Experience is independent variable. 

m Experience is the x-variable. 
= 


Experience is the predictor variable. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 3 


Pay Model for Machinists 


3. Collect Data and Determine 
the Mathematical Relationships 


Pay Model for Machinists 


Step 3. Collect data and determine the mathematical relationships. 


You need to know if there are any mathematical relationships between the critical factors 
and the issue/problem. 


™ Collect the data. 

@ Plot the data. 

@ Determine the relationship. 

Example: 

We have collected data on years of experience and hourly pay for 10 machinists. 
xX = years of experience (independent variable) 

y = hourly rate (dependent variable) 

n = 10 

Mean of x = 9.500 

Mean of y = 10.520 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 3 


Pay Model for Machinists 


3. Collect Data and Determine the 
Mathematical Relationships (Cont'd) 


Scatter Diagram 


16.00 
14.00 +4 1 } 
12.00 } ar eee = 
2 40.00 — 8 

8.00 
6.00 | 
4.00 | 
2.00 
0.00 


Pay 
eo 


Machinists 


Pay Model for Machinists 


Step 3. Collect data and determine the mathematical relationships. (Cont'd) 


We have plotted the data we collected on the machinists and their years of experience. 


What is the conclusion about the 


mathematical relationship between the 
critical factors and the issue/problem? 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 3 


Pay Model for Machinists 


3. Collect Data and Determine the 
Mathematical Relationships (Cont'd) 


Salary 
Experience 


Linear Relationship (straight line) 
y =a + bx 


Salary 
Experience 


Nonlinear Relationship 
y =a + bx + cx? 


Multiple-Factor Linear Relationship (more than two variables) 


y =a + b4x, + Bx, + b3x3 y = Salary 
x, = Experience 
X, = Education 
x, = Performance 


From the plotted points, which model 
appears to best capture the data? 


Pay Model for Machinists 


Step 3. Collect data and determine the mathematical relationships. (Cont'd) 


We have examined the possible relationship between our issue/problem and the 
critical factor graphically, now let’s determine the appropriate algebraic representation 
of the relationship. 


= Linear relationship — A situation in which the relationship between an x-variable and 
a y-variable can best be described by a straight line. The data points plotted on a graph 
of this relationship follow a straight line pattern. 


= Nonlinear relationship — A situation in which the relationship between an x-variable 
and a y-variable cannot be best described by a straight line. The data points plotted on a 
graph of this relationship will appear curvilinear. Examples of nonlinear models include 
parabolas, exponential models, log models, power functions, and other such models. 


= Multiple-factor linear relationship — A statistical technique of creating a model of a 
y-variable as a function of more than one x-variable. It allows an assessment of the joint 
impact of several x-variables on the y-variable. It is used when you want to explain, 
predict, or model the y-variable using more than one x-variable. 


Based upon what we have learned already, 


which formula should be used for our data set? 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify 
and Interpret the Relationship 


To determine a straight line, what do we need to know? 


Straight Line Model 
y =a+bx 


Determine the values of a and b 
for the least squares line y = a + bx 
Slope 


> - 2k-x)y-y 
x(x - x)” 


Intercept 
a= y-—bx 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. 


The slope and intercept of the line of best fit through our data points are based on the 
method of least squares. 


They are mathematically determined by an objective function that minimises the overall 
distances from all the data points to the line of best fit. 


The equation for the slope cannot have zero in the denominator. That is, there must be 
variation in the x-variable. The values cannot all equal the mean for the x-variable. 


We will NOT have to manually use these equations, as they are standard features in 
scientific calculators, computer spreadsheets and other computer software. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Develop and Quantify the Model 


Linear Regression: Calculating a and b 


PROCEDURE: 


As 
2: 


Fix to 3 decimal places. 


Select data entry portion 
of statistics worksheet. 


Clear worksheet. 


Enter data. 


a) 


d 


~~ 


e) 


Enter 1st x-data value. 


Move down 1 display. 


Enter 1st y-data value. 


Move down 1 display. 


Enter 2"9 x-data value. 


Move down 1 display. 


Enter 2" y-data value. 


Move down 1 display. 


Enter 3 x-data value. 


Move down 1 display. 


Enter 3'¢ y-data value. 


Move down 1 display. 


Enter 4‘ x-data value. 


Move down 1 display. 


Enter 4th y-data value. 


Move down 1 display. 


Enter 5th x-data value. 


Move down 1 display. 


Enter 5 y-data value. 


Continued ... 


DISPLAY: 

2nd Format DEC =(Old contents) 
| 3 | [enter] [DEC = 3.000 | 
[x01 = (Old Contents) | 


PRESS: 


2nd Clr Work X01 = 0.000 
8 Enter | [X01 = 8.000 | 

Lv] Y01 = 1.000 

v X02 = 0.000 

5 Enter X02 = 5.000 

7 Y02 = 1.000 


Y03 = 11.900 


v X04 = 0.000 

13 Enter X04 = 13.000 

V Y04 = 1.000 
11.2 Enter Y04= 11.200 
X05 = 0.000 


| 11 Enter | Y05 = 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Develop and Quantify the Model 
Linear Regression: Calculating a and b 


Continue entering data: 


PROCEDURE: 


f) 


~— 


g 


h) 


j) 


Move down 1 display. 
Enter 6" x-data value. 
Move down 1 display. 
Enter 6" y-data value. 
Move down 1 display. 
Enter 7 x-data value. 
Move down 1 display. 
Enter 7‘ y-data value. 
Move down 1 display. 
Enter 8 x-data value. 
Move down 1 display. 
Enter 8" y-data value. 
Move down 1 display. 
Enter 9' x-data value. 
Move down 1 display. 
Enter 9* y-data value. 
Move down 1 display. 
Enter 10* x-data value. 
Move down 1 display. 


Enter 10‘ y-data value. 


PRESS: DISPLAY: 
v X06 = 0.000 
5 Enter X06 = 5.000 
v Y06 = 1.000 
9.4 Enter  YO6= 9.400 
X07 = 0.000 
18 | [| Enter | | x07 = 18.000 
Y07 = 4.000 
[ 13.4 | [ Enter] [Yo7 = 13.400 
v X08 = 0.000 
15 Enter  X08= 15.000 
v Y08 = 1.000 
125 Enter YO8= 12.500 
[ x09 = 0.000 
ws _| [Enter] [x09 = 2.000 
[ Yoo = 1.000 | 


a [ Enter ] | Yoo = 7.300 


8 Enter 
& 
10.5 Enter 
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X10 = 0.000 
X10 = 8.000 
Y10 = 1.000 
Y10 = 10.500 


EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Develop and Quantify the Model 


Linear Regression: Calculating a and b 


2nd Stat 1-V 
(or whatever model used previously) 


2nd) | SET | LIN 


2nd SET Ln 
| 2nd SET —s| EXP 
2nd SET PWR 


You must be in the “Stat” menu before you can change models. 
Once there, you can change models by pressing 2nd Function, 
then Set, to get to the next model. 


If you were previously in the 1-V model, you will only need to 
press 2nd Function, and then Set. 


Stop when you arrive at LIN, which represents the “linear model.” 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Develop and Quantify the Model (Cont'd) 


Linear Regression: Calculating a and b 


v n=10 
(v | | ete. | . X = 9.500 

+ etc. 

vo | 

S a= 7.033 

Vv b = 0.367 | 

b r= 0.963 
vy sete. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


Predicting Salary 


Answer: a = 7.033 
b = 0.367 


Least squares 
equation y =a+bx 


for this linear ’ = 7.033 + 0.367x 
regression model: 


< 
II 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 


Predicting Salary 


m The solution is we pay machinists a basic rate of 7.033 per hour for no experience. 
They receive 0.367 per hour more for each additional year of experience. 


m Why do we use y’? 


— |trepresents the predicted value. The original data had x and y pairs of data. 
These are the original y values. Any y-value that falls on the line is a predicted 
value of y. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


Predicting Salary: Line of Best Fit 


1. Calculate any two points based upon 
the regression equation. 
Plot the two points. 


Draw the line through these two points. 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


First, let x =0 Next, let x = 10 
and solve for y’ and solve for y’ 
Point 1 Point 2 
x=Oy —7 x =10,y’=? 
y’ = 7.033 + (0.367)(0) = y’ = 7.033 + (0.367 )(10) 
= 7.033 = 7.033 + 3.67 
= 10.703 
‘ i 
(0, 7.033) (10, 10.703) 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 
Predicting Salary 
Which two points to pick? 


We'll solve for two points on the line: (y = a + bx). 


Point 1 Point 2 
y a ae 4 Y= 10;y°=:7 
(Intercept always on the line) y’ = 7.033 + (0.367)(10) 


y’ = 7.033 + (0.367)(0) 7.033 + 3.67 
7.033 10.703 


Therefore, Point 1 corresponds Therefore, Point 2 corresponds 
tox = Oandy’ = 7.033 tox = 10 andy’ = 10.703 


a} L 
(0, 7.033) (10, 10.703) 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


Line drawn through (0, 7.033) and (10, 10.703) 


Regression Model 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


Line drawn through (0, 7.033) and (9.500, 10.520) 


Regression Model This tim e, the 

. hy -] vee al “second point” 

on | | ; *— _—sccorresponds to 
ie | I : : 
£4900 | | __ the intersection 
2 8.00 ae | | i of the two means. 
§ 6.00 + | _} — : . : 
= 400 _(X, y) Grand Mean | | This point will 

2.00 | | always fall on 

ee eee ~=Cétihe line. 


0 2 4 6 8 10 12 44 16 18 20 


Experience 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


To use the line for prediction, 
substitute for x and calculate y’. 


Example: ; 

What is the predicted YY = 7.033 + 0.367x 
salary fora machinist * = 11 

with 11 years of ’ = 7.033 + (0.367 )(11) 
experience? = 11.07/hr. 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 
Predicting Salary 

™ Let's predict what the salary would be for a machinist with 11 years of experience. 
m What variable represents years of experience? 


—  |tis the x-variable, the independent variable, so we substitute 11 for x and solve 
for the predicted value of y. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


Data: Use data from the previous calculation. 
Consider 11 years of experience. 


PROCEDURE: PRESS: DISPLAY: 

1. Move display to x’ value. a x’= 0.000 
2. Enter x’ value (11). 11 Enter x’= 11.000 
3. Move display to y’ value. y y’= 0.000 
4. Compute y’ value. CPT y’= 11.071 


DO NOT CLEAR CALCULATOR 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 
Predicting Salary 


Calculate predicted salary using the TI BAII Plus calculator. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 4 


Pay Model for Machinists 


4. Develop the Model, Quantify and 
Interpret the Relationship (Cont'd) 


Regression Model 


1400 |___x = 11,y = 11.90 | 


», 12.00 — ry 


8.00 1 1 1 
If x = 11, y’ = 11.07 


Machinists’ Pa: 
a 
o 
° 


Experience 


If a person has 11 
years of experience, 
we would predict 
their pay to be 11.07 
per hour, based on 
the linear model. 


The actual pay for a 
person with 11 years 
of experience is 
11.90 per hour. 


The model under- 
estimated by 0.83 
per hour in this case. 


Pay Model for Machinists 


Step 4. Develop the model, quantify and interpret the relationship. (Cont'd) 


Here are the two points plotted. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 5 


H@ How is r calculated? = [(x lie xy . y)] 


@ What is the easy way to find r ? 
What is r for this example? 


What does r tell us? 


Pay Model for Machinists 
5. Evaluate the Model 
Correlation Coefficient 


Use the STAT menu 
r= .963 


The strength and direction of the relationship 


Pay Model for Machinists 


Step 5. Evaluate the model. 


Correlation Coefficient 


The correlation coefficient provides a measure of the strength of the relationship 
between the x-variable and the y-variable. 


It is based on a mathematical equation. 


We will not use the equation in this seminar. Instead, we will take advantage of the 
built-in equations within the Stat menu of the calculator. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 5 


Pay Model for Machinists 
5. Evaluate the Model 


Coefficient of Determination 


= What does r? tell us? 


Example: r2__ = (.963) 2 


.926, rounded to .93 


Pay Model for Machinists 


Step 5. Evaluate the model. (Cont'd) 


Coefficient of Determination 


m@ What does r? tell us? 


The coefficient of determination provides a practical interpretation of the 
relationship between the variables. 


It tells us the percent of the total variation in the y-variable that can be attributed 
to the relationship with the x-variable. 


If r2 = .93, we can conclude that 93% of the variation in machinists’ pay can be 
accounted for by a knowledge of the relationship between pay and experience. 


The remaining 7% may be attributed to other factors not yet included in the model 
or may be attributed to chance. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 5 


Pay Model for Machinists 
5. Evaluate the Model (Cont'd) 
Calculate 


PROCEDURE: PRESS: DISPLAY: 


Computer? |_x? | 


Pay Model for Machinists 


Step 5. Evaluate the model. (Cont'd) 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 5 


Pay Model for Machinists 


5. Evaluate the Model (Cont'd) 


When is r big enough? 
™ Depends on situation 
™ Statistical significance vs. practical value 
= Comparing r2 values 


Case1 | Case 2 
r =0.80 | r =0.40 


r2 = 0.64 r2= 0.16 


Pay Model for Machinists 


Step 5. Evaluate the model. (Cont'd) 
When is r big enough? 
m@ Depends on situation 


In Case 1, when we square 0.80 to obtain RSQ = 0.64, we can explain 64% of the 
variation in y by knowing x. In Case 2, when we square 0.40 to obtain RSQ = 0.16, we 
can only explain 16% of the variation in y by knowing x. Therefore, we can explain four 
times as much about the y-variable in Case 1 by knowing the x-variable than we can in 
Case 2. 

= Statistical significance vs. practical value — Even if r is statistically significant, it may 
not have much practical value. 


= Compare r2 values — Comparing r2 values provides much more practical and readily 
understandable information. 
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EXAMPLE — DEVELOPING A PAY MODEL: STEP 6 


Pay Model for Machinists 
6. Take Appropriate Action 


@ RSQ has practical value 


— 93% of pay can be accounted for 
by experience 


m@ Make recommendations 
= Implement pay model 


@ Monitor pay model in the future 


Pay Model for Machinists 
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Cautions of Correlations 


Caution No. 1 


Not cause-and-effect 


Caution No. 2 


Not all relationships 
are linear 


Nonlinear Relationship 


Cautions of Correlations 


= Not cause-and-effect — Correlation does not necessarily imply cause-and-effect 
relationships. 


= Notall relationships are linear — The correlation coefficient obtained from a linear 
model only measures the strength of a linear relationship. 
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Cautions of Correlations (Cont'd) 


- Caution No. 3 —— ) 


Respect range of data 


-_ Caution No. 4 


Ensure adequate sample size 


Cautions of Correlations (Cont'd) 


= Respect range of data — Be cautious when using the model to predict values that fall 
well beyond the range of data that were used to develop the model. 


m™ Ensure adequate sample size — The sample size must be adequate to have a high 
comfort level with the conclusions. A common rule of thumb is to have at least 10 data 
points for every predictor variable, plus “10 for the pot.” Therefore, if we are trying to 
predict the y-variable from the knowledge of one x-variable, we should have at least 
10 + 10 = 20 cases. 


Source: Beatty, James R., Statistical Methods, 5" edition, McGraw Hill Publishers, 1997. 
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Multiple Regression Model 
y=a+b,x, + b,x, + b3x3...+ b,x, 


y = pay 

X, = experience 

XxX, = performance level 
X3; = education 


a » intercept 
X4,X2,X3 % Critical factors 
b,,b,,b, » slopes (i.e., regression weights) 
of critical factors 


Multiple Regression Model 
™ Notice the model again has an intercept. 
™ It also has a slope (i.e., a regression weight) associated with each predictor variable. 


= For this model, we should have approximately 40 or more data points. 


Use multiple regression when the variation in the 


dependent variable cannot be “explained” or predicted 
to the accuracy desired in terms of one variable. 
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MULTIPLE REGRESSION 


Applications of 
Multiple Regression Models 
m@ Establishing job worth 


m@ Examining how supervisors make 
decisions 


™@ Projecting funding needs 
@ Evaluating budget increase trends 


@ Modelling decisions 


Applications of Multiple Regression Models 


Establishing job worth — establishing job worth as a function of a number of 
compensable factors 


Examining how supervisors make decisions — examining how supervisors make 
employee performance decisions from numerous input variables 


Projecting funding needs — projecting pension-funding needs from economic 
conditions, variation in interest rates, retirement patterns, and other factors 


Evaluating budget increase trends — evaluating salary budget increase trends from 
historical time data, political factors, labour negotiation data, and economic fluctuations 


Modelling decisions — modelling executive remuneration decisions based on 
organisational financial ratios, employee data, and personal characteristics 
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MULTIPLE REGRESSION 


Example — Multiple Regression in Remuneration 


Percent Pay 


Increases 1 5 0.9 
2 5 1.0 

for Employees 3 5 1.0 
at DMD 4 5 1.1 
5 4 0.9 

6 4 0.9 

7 4 0.9 

8 4 1.0 

9 4 1.0 

10 4 1.0 

11 4 1.0 

12 4 1.1 

13 3 0.9 

14 3 0.9 

15 3 1.0 

16 3 1.0 

17. 3 1.0 

18 3 1.1 

19 2 0.9 

20 2 1.0 


Example — Multiple Regression in Remuneration 


We could try to predict percent pay increase from a knowledge of job performance. 
We could try to predict percent pay increase from a knowledge of compa-ratio. 


However, we might be able to better understand percent pay increase froma 
knowledge of both job performance and pay simultaneously. 


To do so, we will use a multiple regression model. 
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MULTIPLE REGRESSION 


Simple Linear Regression Prior to 
Examining Multiple Regression 


Bivariate Model 1: Percent Pay Increase as a Function 
of Performance Only (2-Dimensional Scatterplot) 


Pay = Function (Performance) 


3 
= 


y’ = -1.7809 + (1.9475)x 
r = 0.8326 | 
r? = 0.6932 


Percent Pay Increase 
os nD o > a ao N“ o wo 


o 


Performance 


Simple Linear Regression Prior to Examining Multiple Regression 
™ Bivariate Model 1 plots percent pay increase vs. performance. 
m™ The correlation coefficient is r = 0.8326. 


™ The coefficient of determination is r2 = 0.6932. 
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MULTIPLE REGRESSION 


Simple Linear Regression Prior to 
Examining Multiple Regression (Cont'd) 


Bivariate Model 2: Percent Pay Increase as a Function 
of Compa-Ratio Only (2-Dimensional Scatterplot) 


Pay = Function (Compa-Ratio) 


a = — = = 
9 | : ! | y’ = 16.9293 — (11.7391)x 
| t r = -—0.3782 
s | * r? = 0.1430 
= 5 = 
| 
23 e ® 
2 | 
1 
07 ' + 
0.8 0.85 0.9 0.95 1 1.05 1.1 1.15 
Compa-Ratio 


Simple Linear Regression Prior to Examining Multiple Regression (Cont'd) 


Bivariate Model 2 plots percent pay increase vs. compa-ratio.. 
The correlation coefficient is r = —0.3782. 
The coefficient of determination is r? = 0.1430. 


Notice that the tilt of the line is negative; thus, the slope of the line is negative, as is the 
correlation coefficient. 
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MULTIPLE REGRESSION 


Visualizing Multiple Linear Regression 


Multiple Model 3: Percent Pay Increase 
as a Function of Both Performance and Compa-Ratio 
(3-Dimensional Scatterplot) 


- 


Pay Inérease 


a 
e 
o 
s 
p 
= 


Visualizing Multiple Linear Regression 


™ Notice that when we have two predictor variables (i.e., two x-variables) and one 
y-variable, the relationship can no longer be depicted in a two-dimensional graph. 


m@ We now have to view the simultaneous relationship among the three variables in 
three-dimensional space. 
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MULTIPLE REGRESSION 


Multiple Linear Regression 


Model 3: Percent Pay Increase as a Function 
of Both Performance and Compa-Ratio 


y’ = 13.1428 + (2.1241 *x,) — (15.8951 *x,) 


where: 


, 


y = the percent pay increase 
13.1428 = the intercept of the multiple regression model 


2.1241 = the slope, regression weight, or coefficient 
of the first x-variable 


x, = the performance variable 


—15.8951 = the slope, regression weight, or coefficient 
of the second x-variable 


X, = the compa-ratio variable 


Multiple Linear Regression 


m In this example, there is one intercept and two slopes. Since s!opes are difficult to 
visualise in more than two-dimensional space, these slopes are often referred to as 
regression weights, regression coefficients, or raw weights. 


m™ We rely on computer software to calculate the intercept and the slopes in multiple 
regression. The task would be far too labour intensive to perform manually. 


™ However, obtaining these values using software is straightforward and is commonly 
done. 
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MULTIPLE REGRESSION 


Multiple Linear Regression — 
Conclusion 


Model 3: Percent Pay Increase as a Function 
of Both Performance and Compa-Ratio 


y’ = 13.1428 + (2.1241 *x,) — (15.8951 °x,) 
r = .9746 
.9497 


® 
H} 


Multiple Linear Regression — Conclusion 


Here, using two variables instead of one increases the RSQ to .95, resulting in 95% 
of the variation in pay being accounted for by a knowledge of performance and 
compa-ratio. 

When using only one variable, the strongest of the two bivariate models yielded an 
RSQ value of .69. Using performance alone did not account for as much of the 
variation in pay as when using both performance and compa-ratio simultaneously. 


Notice that the multiple regression RSQ of .95 is greater than the sum of the two 
separate RSQ values (i.e., .95 is greater than .69 + .14). 


In reality, as performance goes up, an employee’s compa-ratio often goes up as a 
natural progression in the pay system. These two predictor variables are not 
completely independent of each other. 


In reality, high-performing individuals with low compa-ratios are often granted higher 
percent pay increases than high-performing individuals with average or above-average 
compa-ratios. Therefore, the two variables tend to have an interaction effect. 


One last caution: Keep in mind that we should have had about 30 or more observations 
in this example, since we had two predictor variables. 
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Can You... 


= Identify applications of regression 
models in an HR environment? 


™ Develop a regression model? 


m Identify cautions in the 
interpretation of correlations? 


= Explain why one would use a 
multiple regression model and 
identify potential applications? 
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Module 9 Quiz 


1: 


What is the slope of the line between A and B in the graph below? 


What type of regression model does the following equation suggest? 
y=a+b,x, 
Multiple factor regression 


Nonlinear regression 


Bivariate regression 


a 


Curvilinear regression 


What type of relationship does the following equation model? 


y=a+ b,x, + box, 


. Nonlinear regression 
. Straight line regression 
. Multiple linear regression 


. All of the above 


0 0O WwW D> 
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Module 9 Quiz (Cont'd) 


4. What conclusion can be drawn from a = 
the graph at right? 35,000 


30,000 


A. Experience and annual pay 
have no relationship. 


ie) 
on 
So 
f=) 
Oo 


B. Annual pay decreases as 
experience increases. 


Annual Pay 
NO 
Oo 
r=) 
oS 
So 


C. There is a linear relationship 15,000 
between experience and 
annual pay. | 10,000 
D. Pay and experience have an 0 
inverse relationship. 1 2 3 4 5 
Experience 


5. What does a correlation of +1.0 indicate? 


A. A positive slope 
A negative slope 


A positive intercept 


oO Oo @ 


A negative intercept 


6. What type of relationship does this equation model? 
y =at+b,x, + b,x,” 

Nonlinear regression 

Straight line regression 


Bivariate regression 


All of the above 


Es E+ 
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Module 9 Quiz (Cont'd) 


7. You are asked to examine the relationship between job worth and market-based pay 
for similar jobs. You have defined the following regression model for examining this 
relationship: Market based pay = Intercept + (slope) x (job worth). You have collected 
data and have quantified your model as follows: y’ = 15,000 + 50x. The value of 
“15,000” in the model represents which of the following? 

A. The slope of the line 


B. The point where the line of best fit crosses the x-axis corresponding to a y-value 
of zero 


C. The mean for y’ 


D. The intercept for the line of best fit 


8. For the same problem set, what does the value “50” represent in the following 
equation? 


y’ = 15,000 + 50x 
A. The monetary market-based pay increase associated with every additional point 
value in the job worth scale 


B. The ratio of the change in y-values relative to the change in x-values on the line of 
best fit 


C. The slope of the line 
D. All of the above 


9. For the same problem set, you are now wondering what your organisation should pay 
for an entry-level production-manual editor, a job that has been determined to be worth 
500 job evaluation points in your organisation’s system. What would you estimate 
the market-based pay for that job should be, based solely on the linear model of 
y’ = 15,000 + 50x? 

A. 50,000 

B. 15,000 
C. 25,000 
D. 40,000 
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Module 9 Quiz (Cont'd) 


10. For the same problem set, you have determined through your salary survey that the 
market appears to be paying entry-level buyers 42,500. Approximately how many job 
evaluation points should your entry-level buyer position be worth to your organisation, 
assuming the regression model provides a good fit, based solely on the linear model of 
y’ = 15,000 + 50x? 

A. 50 

B. 15,000 

C. 500 

D. 550 
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(blank) 
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Module 4 
Case Study 1 


A large organization has field representatives based in a number of geographic 
locations. The HR Department has designed an incentive program that awards bonus 
points based on productivity. In Module 4, data were presented for a sample of 20 such 
representatives based in Madrid. Additional data have now been collected for a sample 
of 16 field representatives working out of the Rome office for this same organization. The 
following values represent the bonus points these Rome representatives earned over the 
same time period in which the Madrid data were collected: 


Rome Data: 18, 25, 23, 33, 31, 19, 26, 28, 42, 32, 27, 22, 7, 13, 46, and 3 


A. Address the following five basic questions about the Rome data: 


What is the variable of interest? 
What is the level of measurement for your variable of interest? 
Do these data represent a sample or a population? 


Are these data suitable for grouping into intervals (i.e., categories, groups, etc.)? 


OF ee: ss 


Are there any other variables available that can be used to better understand the 
variable of interest? 


B. Rank order the Rome data in a single column from high to low, with the 
biggest value placed at the top of the list. Now, what would a bar graph for 
Rome look like in raw form (without grouping the data)? 


C. Group the data into intervals. Before doing so, consider the following issues: 
1. What would be a reasonable interval width to use for Rome? 
m First find the difference between the highest value (46) and the lowest value 
(3), which is 43. Then divide 43 by both 20 and 5. These two divisions result 
in values of 2.15 and 8.6. Therefore, an appropriate interval width might be 
some value between 3 and 8. Reasonable interval widths might be 3, 4, 5, 6, 
7 or 8. 


m= Read on before making your final decision about the width of the intervals for 
Rome. 
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Module 4 
Case Study 1 (Cont'd) 


2. Do you intend to compare the Rome results with the Madrid results? 


In this scenario, you will probably be interested in comparing the bonus points 
earned in Madrid with the bonus points earned in Rome. 


Notice that the range for Madrid was 54, while the range for Rome was 43. 


Dividing 54 by 5 and by 20 will yield different results than dividing 43 by 5 and 
20. 


If you want to compare samples, you should use the same interval widths for 
the graphs of the two locations. Otherwise, you cannot meaningfully compare 
the two graphs. 


The range of the two locations combined as one group is 63 — 3 = 60. 
Dividing 60 by 5 and by 20 yields potential interval widths somewhere 
between 3 and 12. 


Your best options might now be 5 or 10. 


Arbitrarily decide to use widths of 10 for both locations so you can make 
comparisons. 


Begin at zero, and set the intervals at 0-9, 10-19, 20-29, etc. 


3. Now create a grouped frequency distribution table for Madrid and for Rome. 


D. Create bar graphs. Before doing so, address the following issues: 


1. Do the two sets of data contain the same number of sales representatives? 


In this scenario, the sample from Madrid consists of 20 sales representatives, 
while the sample from Rome consists of 16 sales representatives. 


When creating bar graphs for data in which there are an unequal number of 
observations in the different groups, relative frequency percents instead of 
absolute frequencies should be used on the y-axes. 


This will make any comparisons between the two graphs easier to interpret. 


2. Now, after considering all the above issues, create bar graphs for Madrid and 
Rome. 
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Module 6 
Case Study 2 


In Module 6, measures of central tendency and location were discussed. The Madrid 
example, based on a sample of 20 Madrid field representatives, was used to illustrate 
several of these concepts. As a review, here are the results of many of the measures of 
central tendency and location for Madrid: 


Madrid Data 


Results 
57.20 
45.50 
30.00 
21.50 
16.80 

Does not exist 

30.00 

33.35 


Statistical Measures Rank Order Position 


Mode 
Median 


Use the following data (also presented elsewhere) to determine these same measures of 
central tendency and location for Rome: 


Rome Data: 18, 25, 23, 33, 31, 19, 26, 28, 42, 32, 27, 22, 7, 13, 46, and 3 


Rome Data 


Statistical Measures 

Poo 
P75 
Pso 
Pos 
Pro 

Mode 

Median 
Mean 


Note: The answers given for the Madrid data above were obtained without any rounding. 
The values for the mean and the standard deviation were calculated first and then stored 
into memory cells (such as storing the mean in memory location 7 and storing the standard 
deviation in memory location 8). The subsequent answers for the z-scores, for 2-sigma 
boundaries, and for 3-sigma boundaries will agree with those derived from any computer 
software calculations. However, if the mean and standard deviation have been rounded (as 
they were above when creating the table), and the rounded values are used to perform 
subsequent calculations, answers may differ. The answers for this case study may be 
slightly different, at perhaps the second decimal place. 
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Module 6 
Case Study 2 (Cont'd) 


A percentile bar has already been completed for Madrid. Use it as a guideline and as an 
anchor to create a percentile bar for Rome just to the right of the Madrid percentile bar. 


Madrid 


What observations might you make when comparing Madrid to Rome? Use the 
measures of central tendency, the measures of location, and the percentile bars to help 
you make these observations. 
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Module 7 
Case Study 3 


In Module 7, measures of variability were discussed, as well as z-scores, the 2-sigma 
rule, and the 3-sigma rule. As a review, here are findings for the Madrid data, illustrating 
many of the concepts discussed in that module. These include measures of variability, 
z-scores, boundaries for the 2-sigma rule, boundaries for the 3-sigma rule, and a 
graphical example of the 2-sigma rule as applied to Madrid: 


Madrid 


Statistical Measures 


Range 54.00 
Interquartile Range 24.00 
Standard Deviation 15.84 
z-score for a raw score of 63 +1.87 
z-score for a raw score of 9 —1.54 
Lower boundary for the 2-sigma rule 1.68 
Upper boundary for the 2-sigma rule 65.02 
Lower boundary for the 3-sigma rule -14.16 
Upper boundary for the 3-sigma rule 80.86 


15.84 15.84 15.84 15.84 
1.68 17.51 33.35 49.19 65.02 
—2 SD -1 SD 0SD +1 SD +2 SD 
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Module 7 
Case Study 3 (Cont'd) 


Now use the Rome data to fill in the table below regarding various statistical values: 


Rome Data: 18, 25, 23, 33, 31, 19, 26, 28, 42, 32, 27, 22, 7, 13, 46, and 3 


Rome 


Statistical Measures Results 


Range 


Interquartile Range 


Standard Deviation 


z-score for a raw score of 46 


z-score for a raw score of 3 


Lower boundary for the 2-sigma rule 


Upper boundary for the 2-sigma rule 


Lower boundary for the 3-sigma rule 


What observations might you make when comparing Madrid to Rome? Base your 
conclusions on the measures of variability, the 2-sigma rule, and the 3-sigma rule. 


Does 46 bonus points in Madrid “statistically” appear to be an outlier? 
Does 3 bonus points in Madrid “statistically” appear to be an outlier? 
Does 46 bonus points in Rome “statistically” appear to be an outlier? 
Does 3 bonus points in Rome “statistically” appear to be an outlier? 


Why is the lower boundary for the 3-sigma rule a negative value for Rome? 


GR2 = CS.8 


© WorldatWork. All rights reserved. 


Relative Frequency % 


Module 8 
Case Study 4 


In Module 8, shapes of distributions were 
discussed. The module emphasized the Poo 57.20 
importance of determining whether a distribution ES 


was symmetric, skewed, flat, or perhaps normal. 


Terminology such as “skewed to the left,” P75 45.50 
“skewed to the right,” “negatively skewed” and 
“positively skewed” was discussed. The module Poo 37.50 
concluded with a discussion of the normal moan 239 P 31.25 
distribution. Peso ad i ; 
Sue PO BS 
Use the bar graphs and the percentile bars Pas a 21.50 p 48.75 
presented here, as well as other information Pro 16.80 a : 
to draw conclusions about the shapes of the é Sarno 
distributions for Madrid and Rome. a : 
Madrid Rome 
Madrid Data: Interval Widths of 10 
(Starting at the Origin) Rome Data: Interval Widths of 10 
Converted to Relative Frequency % (Starting at the Origin) 


50% 
40% 
30% 
20% 


10% 


Relative Frequency % 


0% 


45 145 245 345 445 545 645 


Bonus Points 


Madrid and Rome Data Combined 


10% 


Relative Frequency % 


0% 


45 145 245 35 445 545 645 


Bonus Points 
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245 345 445 545 645 


Bonus Points 


Module 9 
Case Study 5 


In Module 9, the concepts of correlation and regression were discussed. Hopefully, you 
have concluded that the sample of Madrid sales representatives earn more bonus points 
than the sample of Rome sales representatives, based on the measures of central 
tendency and location. Hopefully, you also have concluded that there is much more 
variation in bonus points for these Madrid representatives than for Rome representatives. 


After considering such conclusions, you should have begun to wonder “Why?” You could 
consider many possible factors that might be related to the variation in bonus points, 
including performance, experience, age, territory, motivation, the skills of the managers, 
training, and many other factors. Data have now been collected for each location 
regarding months of experience for these same sales representatives, as follows: 


Madrid Data Rome Data 


Sales Months Bonus Sales Months 
Person Experience Points Person Experience 
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Module 9 
Case Study 5 (Cont'd) 


A. Address the following questions for Madrid and Rome. Base your decisions on the 
relationship between experience and bonus points: 


Key Questions 


. ls there a relationship? 


. Is it perfect? 


3. Are there any outliers? 


4. If there is a relationship, is the 
relationship linear? 


[5 If it is linear, is it positive or negative? 


6. If it is nonlinear, what is the nonlinear 
pattern? 


7. Does the graph reflect what you 
| expected? 


B. Address the following questions for Madrid and Rome. Base your decisions on the 
relationship between experience and bonus points: 


Statistical Values Madrid 


Correlation coefficient 


Coefficient of determination 


Intercept 
ie p 


Slope 


1. Given that a new employee in Madrid has 120 months of experience as a sales 
representative, how many bonus points would you expect this person to earn, 
based on your regression model? 


2. Given that a Madrid sales representative has earned 60 bonus points, how many 
months of experience would you expect for this person, based on your 
regression model? 
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Module 9 
Case Study 5 (Cont'd) 


3. Given that a new employee in Rome has 120 months of experience as a sales 
representative, how many bonus points would you expect this person to earn, 
based on your regression model? 


4. Given that a sales representative in Rome has earned 40 bonus points, how 
many months of experience would you expect for this person, based on your 
regression model? 


5. Suppose you hire a new employee in Rome. The person has 25 years of 


experience as a sales representative prior to joining the organization. How many 
bonus points might you expect this person to receive over the same time period? 
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Module 4 
Case Study 1 Answers 


A. Address the following five basic questions about the Rome data: 


1. What is the variable of interest? 


Bonus points 


2. What is the level of measurement for your variable of interest? 


Ratio 


3. Do these data represent a sample or a population? 


Sample 


4. Are these data suitable for grouping into intervals (i.e., categories, groups, etc.)? 


Yes 


5. Are there any other variables available that can be used to better understand the 
variable of interest? 


No. At least we do not have any other variables at the present time. 


B. Rank order the Rome data in a single column from high to low, with the biggest 
value placed at the top of the list. Now, what would a bar graph for Rome look like in 
raw form (without grouping the data)? 


Rome Data 


Absolute Frequencies 


012345678 9INDNRBH HB 6 TM B 20 2122 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 4142 43 44 4546 


Bonus Points 
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Relative Frequency % 


Module 4 
Case Study 1 Answers (Cont'd) 


C. Group the data into intervals. These frequency distribution tables for Madrid and Rome 
are based on a number of arbitrary decisions, including using the same interval widths for 
both Madrid and Rome, by dividing the full range (60) of the data by 5 and by 20, by 
settling on interval widths of 10, and by beginning the bottom interval at zero. 


Madrid: Grouped Data 
Intervals with Constant Widths of 10 
Bottom Interval Begins at Zero 


NOTE: 


Rome: Grouped Data 
Intervals with Constant Widths of 10 
Bottom Interval Begins at Zero 


Dit [ul [wp [fi] 
eo | 69 | 645 [ 0 | 0% 
= 
[40 | 40 [445 [2 | 13% | 
Ml! 
[20 [20 [24.5 [6 | 38% | III 
Ml 
To [9 [45 [2 [13%] 
TOTAL 102% 


The values in the f% column for Rome were rounded to two decimal places. Therefore while it 


“appears” that this column sums to 102%, if each of these values is shown to four decimal places, it 
can be seen that the column actually does sum to 100%. 


D. Create bar graphs. Here are three examples. One bar graph is for Madrid, one is for Rome, 
and one is for Madrid and Rome combined. Notice they are based on “relative frequency 
percents,” since the two samples are not of equal size. 


Madrid Data: Interval Widths of 10, Starting 

at Zero, and Converting to Relative Frequency % 
50% 
40% 
30% 
20% 
10% 
0% 
545 


245 345 445 


Bonus Points 


45 145 


Madrid and Rome 


Relative Frequency % 


145 


645 


245 


1 


Relative Frequency % 


20%2% 


345 


50% 
40% 
30% 
20% 


Rome Data: Interval Widths of 10, Starting 
at Zero, and Converting to Relative Frequency % 


0% 
0% 


245 345 445 545 645 


Bonus Points 


45 145 


Combined 


445 545 645 


Bonus Points 
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Module 6 
Case Study 2 Answers 
In Module 6, measures of central tendency and location were discussed. The Madrid example, 
based on a sample of 20 Madrid field representatives, was used to illustrate several of these 


concepts. As a review, here are the results of many of the measures of central tendency and 
location for Madrid: 


Madrid Data 
Rank Order Position 


Results 


21.50 
16.80 
Does not exist 
30.00 


33.35 


Use the following data (also presented elsewhere) to determine these same measures of 
central tendency and location for Rome: 


Statistical Measures 


Rome Data: 18, 25, 23, 33, 31, 19, 26, 28, 42, 32, 27, 22, 7, 13, 46, and 3 


Rome Data 
Rank Order Position 


Results 


Statistical Measures 


Does not exist 
25.50 
24.69 


Median 8.50 
Mean NA 

Note: The answers given for the Madrid data above were obtained without any rounding. The values 
for the mean and the standard deviation were calculated first and then stored into memory cells (such 
as storing the mean in memory location 7 and storing the mean in memory location 8). The subsequent 
answers for the z-scores, for 2-sigma boundaries, and for 3-sigma boundaries will agree with those 
derived from any computer software calculations. However, if the mean and standard deviation have 
been rounded (as they were above when creating the table), and the rounded values are used to 


perform subsequent calculations, answers may differ. The answers for this case study may be slightly 
different, at perhaps the second decimal place. 
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Module 6 
Case Study 2 Answers (Cont'd) 


Using the percentile bar for Madrid as a guideline and as an anchor, a percentile bar for Rome 
has now been placed just to the right of the Madrid percentile bar. 


Madrid 


Based on the measures of central tendency, the measures of location, and the percentile 
bars, you might draw conclusions about these data such as: 


The mean for Madrid is greater than the mean for Rome. 


2. The values of Pgo, P75, Pso (i.e., the median), P25, and P19 for Madrid all exceed their 
corresponding values for Rome. 


3. The location of the percentile bar for Madrid is higher on the graph than for Rome. 
4. The percentile bar for Madrid is longer (i.e., taller) than for Rome. 


5. Generally speaking, the distribution of bonus points for Madrid is considerably different 
than for Rome. 


6. On average, the sample of sales representatives in Madrid is obtaining more bonus points 
than the sample in Rome. 


7. The next logical question becomes “Why?” This question will be addressed in later case 
study components. 
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Module 7 
Case Study 3 Answers 


In Module 7, measures of variability were discussed, as well as z-scores, the 2-sigma rule, 
and the 3-sigma rule. As a review, here are findings for the Madrid data, illustrating many of 
the concepts discussed in that module. These include measures of variability, z-scores, 
boundaries for the 2-sigma rule, boundaries for the 3-sigma rule, and a graphical example of 
the 2-sigma rule as applied to Madrid: 


Madrid 


Statistical Measures 


Interquartile Range 


Standard Deviation 


z-score for a raw score of 63 


z-score for a raw score of 9 


Lower boundary for the 2-sigma rule 


Upper boundary for the 2-sigma rule 


Lower boundary for the 3-sigma rule 


Upper boundary for the 3-sigma rule 


1.68 17.51 33.35 49.19 65.02 
-—2 SD -1 SD 0SD +1 SD +2 SD 


GR2 g CS Answers.6 


© WorldatWork. All rights reserved. 


Module 7 
Case Study 3 Answers (Cont'd) 


1. Use the Rome data to fill in the table below regarding various statistical values: 


Rome 
Statistical Measures | Results 
Range 43.00 
Interquartile Range 12.50 
Standard Deviation 11.38 
z-score for a raw score of 46 +1.87 
z-score for a raw score of 3 —1.91 
Lower boundary for the 2-sigma rule 1.93 
Upper boundary for the 2-sigma rule 47.44 
Lower boundary for the 3-sigma rule -9.44 
Upper boundary for the 3-sigma rule 58.82 


Note: As in Case Study 2, the answers may be slightly different. 


2. What observations might you make when comparing Madrid to Rome? Base your 
conclusions on the measures of variability, the 2-sigma rule, and the 3-sigma rule. 

3. Does 63 bonus points in Madrid “statistically” appear to be an outlier? 
No. The z-score of +1.87 associated with 63 bonus points in Madrid is within the 2-sigma 
limits. 

4. Does 9 bonus points in Madrid “statistically” appear to be an outlier? . 
No. The z-score of —1.54 associated with 9 bonus points in Madrid is within the 2-sigma 
limits. 

5. Does 46 bonus points in Rome “statistically” appear to be an outlier? 
No. The z-score of +1.87 associated with 46 bonus points in Rome is within the 2-sigma 
limits. 

6. Does 3 bonus points in Rome “statistically” appear to be an outlier? 
No. The z-score of —1.91 associated with 3 bonus points in Rome is within the 2-sigma 
limits. 

7. Why is the lower boundary for the 3-sigma rule a negative value for Rome? 


Subtracting three standard deviations from the Rome mean results in a value that falls below 
zero. Of course, there are no bonus points below zero for Rome. However, the 3-sigma rule 
still holds. That is, at least 88.89% of the cases fall between 0 and the upper 3-sigma 
boundary. 


GR2 = CS Answers.7 


© WorldatWork. All rights reserved. 


Relative Frequency % 


Module 8 
Case Study 4 Answers 


In Module 8, shapes of distributions were discussed. As can be seen in the percentile bars, the 
distribution for Madrid appears to be positively skewed (skewed to the right). This is because the 
mean is to the right of the median (bigger than the median). Also, the distance between the 
median and P7s is greater than the distance between the median and P25. On the other hand, an 
examination of the percentile bar for Rome does not reveal much skewness at all. We might be 
tempted to conclude that Rome is negatively skewed, 
since the mean is less than the median. Yet, the 
distance between the mean and median in Rome is 
very small. Further, the percentile bar and the bar 
graph for Rome are much more symmetric than those 
for Madrid. We must remember, however, that just 
because these graphs for Rome appear symmetric, 
we should not conclude that the distribution is 
“normal.” Notice that while the bar graph for Rome is 
“symmetric,” the “middle” bar is much higher on the 
graph than would have occurred if the distribution 


were “normal.” 
Madrid 


Madrid Data: Interval Widths of 10 
(Starting at the Origin) Rome Data: Interval Widths of 10 


Converted to Relative Frequency % (Starting at the Origin) 


Relative Frequency % 


45 145 245 345 445 545 645 5 45 245 345 445 545 64.5 


Bonus Points Bonus Points 


Madrid and Rome Data Combined 


10% 


Relative Frequency % 


0% 


45 145 245 345 445 S45 645 


Bonus Points 
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Module 9 
Case Study 5 Answers 


In order to address the questions for Madrid and Rome, you should first plot the data. Here 
are the graphs for the two locations: 


Bonus Points 


Madrid: Experience and Bonus Points Rome: Experience and Bonus Points 
70 50 
|e 45;———- | ° 
__ 2 
.— 
Oo 
o 
n 
=) 
Cc 
[e) 
co 
(0) _ eeewrners eens (eee 1 eae } j : ‘ — es eee (eee 
0 20 40 60 80 100 120 140 160 180 0 20 40 60 80 100 120 140 
Months of Experience Months of Experience 


A. The following table provides responses to each of the seven questions regarding the 
relationship between months of experience and bonus points for Madrid and Rome: 


Key Questions Madrid Rome 


Yes (but not as strong 
? 
. ls there a relationship? as for Madrid) 
. ls it perfect? No No 


. Are there any outliers? ; . 
y use z-scores to examine) | use z-scores to examine) 


Not obvious (but you can | Not obvious (but you can 
u 


4. lf there is a relationship, is the 


relationship linear? Yes 
. If it is linear, is it positive or negative? Positive Positive 
. If it is nonlinear, what is the nonlinear F . 
pattern? Not applicable Not applicable 
. Does the graph reflect what you Maybe Maybe 


expected? 


B. The following statistical values are obtained for Madrid and Rome: 


Statistical Values 


Correlation coefficient +0.8005 +0.4004 


Coefficient of determination 0.6407 0.1603 
Intercept 16.4688 17.1968 


Slope 0.2923 0.1300 
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Module 9 
Case Study 5 Answers (Cont'd) 


Given that a new employee in Madrid has 120 months of experience as a sales 
representative, how many bonus points would you expect this person to earn, based 
on your regression model? 


y’ = 51.55 


Given that a Madrid sales representative has earned 60 bonus points, how many 
months of experience would you expect for this person, based on your regression 
model? 


x’ = 148.92 


Given that a new employee in Rome has 120 months of experience as a sales 
representative, how many bonus points would you expect this person to earn, based 
on your regression model? 


y’ = 32.80 


Given that a sales representative in Rome has earned 40 bonus points, how many 
months of experience would you expect for this person, based on your regression 
model? 


x’ = 175.42 


Suppose you hire a new employee in Rome. The person has 25 years of experience 
as a sales representative prior to joining the organization. How many bonus points 
might you expect this person to receive over the same time period? 


You might predict they would earn 56.19 bonus points, based on the linear model for 
Rome. However, you should be cautious about several issues that limit your ability 
to make such conclusions: 


a. The correlation between months of experience and bonus points for Rome is not 
very strong. In fact, only approximately 16% of the variation in bonus points can be 
explained by months of experience. 


b. The sample size for Rome is only 16. You should have “at least 10 data points for 
each predictor variable, plus 10 for the pot” = (10 x 1) + 10 = 20 data points before 
relying too heavily on any conclusions that might be made from these data. 


c. The x-value of 25 years (i.e., 25 x 12 = 300 months) of experience is well beyond 
the original range of data collected for this example. In fact, the most experienced 
sales representatives in the sample from Rome only had 132 months of 
experience. Therefore, you should not expect the regression model to be very 
reliable when using such an extreme x-variable in an attempt to predict a y-value. 
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Module 1 Quiz Answers 


ak WN = 
0 Os © © ®) 


GR2 @ APP.2 


© WorldatWork. All rights reserved. 


Module 1 Practice Questions 
Answers 


1. Determine the level of measurement for each of the following items. Place N next to 
each nominal measurement, O next to each ordinal measurement, I next to each 
interval measurement and R next to each ratio measurement. 


N Religious classification (Protestant, Catholic, Moslem, Jewish, Other, None) 


| A list of temperatures in European capitals, based on Celsius temperature scale 


O__ College class standing (freshman, sophomore, junior, or senior) 


R__ A\list of the ages of your senior managers 


N _ Political party affiliations (e.g., Republican, Democrat, Libertarian, 
Independence, Green, Peace and Freedom, etc.) 


ye) 


Heights of machinists 


Rank order of the heights of a group of machinists 


O 


R Weights of suitcases stored on an airplane 


Job performance as measured by the number of parts produced 


v8) 


OQ __ Job performance as measured by a rating scale 


Zz 


Gender (male or female) 


R Months of experience in remuneration and/or benefits 


Number of previous college-level statistics courses completed 


D 


| Ideal room temperature (in Fahrenheit) 


O _ Anxiety level toward T3 on a 10-point scale, with 1 representing very 
comfortable and 10 representing very anxious 


R Number of employees in the HR department 


R Number of employees serviced by the HR department 
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Module 2 Quiz Answers 


4. B 


You are given information as to the percentages of the budget for the 
three years, but no information of how much the budget is for each year. 
Therefore, it cannot be determined based on the information given. 


Compa-ratio = palate = 25000 = 0.83 
Midpoint 30,000 


Salary 


Percent adjustment ——“ - 
My Salary 


_ 60,000 _, 


50,000 
= 1.20 -1 
= 0.20 


Midpoint 
F +( Desired Range Spread ) 
2 


Minimum 


72,000 
1.25 


= 57,600 


Maximum = Minimum x (1 + Desired Range Spread) 
= 57,600 x (1 + .50) 
= 57,600 x 1.50 
= 86,400 
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Module 2 Quiz Answers (Cont'd) 


5. B 50,000 x .08 = 4,000 


6:-.B 4,000 x .10 = 400 (10% return) 
4,000 + 400 = 4,400 


7. -G 2 x 10 hours = 20 hours 
3x 7 hours= 21 hours 


41 hours 


George = 20/41 = .4878 = 48.8% 
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Module 2 Practice Questions Answers 


1. B 660 Hourly 
420 Salaried part-ti 420 
alaried part-time - 30 or 30% 
320 Salaried full-time 1,400 
1,400 


2,6 12% x 9,500 = .12 x 9,500 = 1,140 


3: 3B No, 800 employees on Dec. 31, 960 in July 
1,000 — .20(1,000) = 800 on Dec. 31 
800 x 1.20 = 960 in July 


The reason it is not the same is because the base changes from 1,000 to 800. 


4. A Compa-ratio = S22 = 1550 -g 998 Market index = 12°? = 0.987 
Midpoint 1,670 1,570 
oF 6,000 = 0.075 ora 7.5% increase 
80,000 


Maximum — Minimum 


6. D Salary range spread Rainicauina 


7,200 — 4,800 
4,800 


I 
on 
o 
on 
S 
oS 
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Module 2 Practice Questions Answers 
(Cont'd) 


Midpoint 
Desired Range Spread ) 


7. B Minimum = 
1+ ( 
2 


| 
— 
e2) 
© 
oO 


Maximum = Minimum x (1 + Desired Range Spread) 


1,600 x (1 + .50) 


= 1,600 x 1.50 


2,400 


8. C 2,000 x 1.40 = 2,800 2,000 
+ 2,800 
4,800 + 2 = 2,400 
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Module 2 Practice Questions Answers 
(Cont'd) 


9. D Minimum = Midpoint — (Maximum — Midpoint) 


950 — (1,200 — 950) 


950 — 250 
= 700 


Maximum — Minimum 


Range spread — 
Minimum 


1,200 — 700 
700 


500 
700 


= 0.71 = 71% 
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Module 3 Quiz Answers 


i Compound interest should be used to determine the amount of interest 
earned. 


a3 477.54 


Year Principal Interest Total 
1 (2,500.00) x (.06) 150.00 2,650.00 
2 (2,650.00) x (.06) 159.00 2,809.00 
3 (2,809.00) x (.06) 168.54 2,977.54 


2,977.54 — 2,500.00 477.54 


t ff t 


future present interest 
value value earned 


3..iD 2,949.51 


Present value = 1,800 

Interest rate = 10% per year, compounded quarterly 
Quarterly interest rate = 2.5% 

Number of periods = 20 (4 quarters per year for 5 years) 


Select percent change/compound interest worksheet 
Clear the worksheet 

Enter present value (1,800) 

Enter interest rate (10/4 = 2.5) 

Enter total number of periods (20) 


OFS ON 


Compute future value 2,949.51 
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Module 3 Quiz Answers (Cont'd) 


4. B 11,841.14 


Future Value = 15,000 

Number of Periods = 8 (4 quarters per year for two years) 
Interest Rate = 12% per year, compounded quarterly 
Quarterly Interest Rate = 3% 


Select percent change/compound interest worksheet 
Clear the worksheet 

Enter ending (new) future value (15,000) 

Enter total number of periods (4 x 2 = 8) 

Enter interest rate (%CH) (12/4 = 3) 

Compute (old) present value (11,841.14) 


a er a 


5. A 1.02 x 1.025 x 1.03 = 1.076865 
1.03 x 1.025 x 1.02 = 1.076865 
1.076865 x 12.50 = 13.46 


72 
6. A Years to double = —_—_—__—_——_ 
Growth Rate 
= 72/7.3 
= 9.86 years 
7. C #PD = 3 
New = 2,698 
Old = 7,000 
%CH = 10.5 
8. B 
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Module 4 Quiz Answers 


ja Since this is for all program participants, it is a population. 


2. B It is usually impractical in terms of both time and expense to collect data for an 
entire population. 


3. D 


Module 5 Quiz Answers 


AS VB 
2;..D 


Module 6 Quiz Answers 


Rank order the data. 13 is the value that occurs the most. 
Rank order the data. 9 is the middle value. 


D 
B 
B 
A 


pe: oe", 


Module 7 Quiz Answers 


(3,000 — 3,500)/200 = —500/200 = —2.5 


oR WwW DYNO =| 
00N0 0 DW 


Your z-score is (19 — 16)/1 = 3; your sister’s z-score is (13 — 16)/1 = -3. 
According to the 3-sigma rule, at least 88.9% of the seniority for employees of 
this company will fall between 13 and 19. 
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Module 8 Quiz Answers 


0 $F O 


Module 9 Quiz Answers 


Point A (1, 15) and Point B (4, 30). To find the slope, use (y,— y,)/(x, — X;,). 
(30 — 15)/(4-1) = 15/3 =5 


1. 


A 


2. C 


> 


10. 


© MO NODA 


?) 


0 UU +> > 


There appears to be a positive linear relationship; annual pay increases as 
experience increases. 


r = +1 indicates a positive slope. It also indicates a perfect correlation. 


y’ = 15,000 + 50x 
y’ = 15,000 + 50x 
y’ = 15,000 + 50x 


y’ = 15,000 + 50x 


y = intercept + slope x 15,000 = intercept 
y =intercept+ slope x 50=slope 


If x = 500, 
y’ = 15,000 + (50)(500) = 40,000 


If y = 42,500, 
42,500 = 15,000 + (50)(x) 
(42,500 — 15,000)/50 = 550 = x 
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Formulas 


Certain formulas are required on the test. However, you do not have to memorise 
them. They will be handed out with the test. These formulas are shown below. 


Population Standard Deviation 


Compound Interest 


FV = PV(1+i)" 


FV = future value 
PV = present value 
i = interest rate per period 


N = number of periods 
Percentiles 
Step 1: 


v=[(x + 100) x n] + [1 —(x + 100)] 


Step 2: 
Percentile value = Lower + 
[(Higher — Lower) x Decimal remainder] 


Correlation Coefficient 


Sample Standard Deviation 


Salary Structure 


Midpoint 


7, DesiredRange Spread 
2 


Minimum = 


Maximum = Minimum (1+ Range Spread ) 


Standard Score or z-score 


Xx-—X 
Z = 
Ss 
Z = z-score 
x = data value 
xX = mean 
s = standard deviation 


Least Squares Line Through n Points 


___SIlx-x)(y-9)) y=a+ x 
i =(x-%)? E(y-¥)? Sipe 
Xo —% 
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GREATER CINCINNATS COMPENSATION 
AND BENEFITS ASSOCIATION 


Statistics in Compensation 
and Benefits 
Made EASY with Excel! 


Presented by Julie A. Elliott, CCP 
Greater Cincinnati Compensation and Benefits Association 


— 


Essence eee ee 


Reprinted with permission from Greater Cincinnati Compensation and Benefits Association, 
Julie A. Elliott, CCP; © 2005. 
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Agenda 


m Statistics review 
= Measures of central tendency 
= Measures of location 
= Measures of variability 
= Practical applications 
= Percentile bar 
= Pay structure design 
= Range penetration 
= Merit matrix model 
m Regression analysis 


ilk, GCCBA. 


Statistics Review 


OLESEN Eee ES ESBS EEE SE EEE SEES SHES EE EE ee Ee EE eae 
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Statistics Review 


m Measures of central tendency 
m= Mean (average) 
= Mode (point that occurs most often) 
= Median (50 percentile) 


= Measures of central tendency give you 
information about what is happening in the 
middle of the data 


Statistics Review 


Monthy 


m Measures of central tendency Healthcare 


Company Premiums 
m Example — Health Care A 164 
Premiums 


175 
152 
135 
168 
182 
143 
146 
164 
182 
127 
138 
133 
172 


AC-—LOMMVO OW 


141 
153 
164 
139 


AODOVOSOZEr 
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Statistics Review 


Monthy 

m= Measures of central tendency Healthcare Rank 
Company Premiums Order 

182 20 
182 19 
175 18 
172 17 
168 16 
164 15 
164 14 
164 13 
164 12 
153 11 
152 
146 
143 
141 
140 
139 
138 
135 
133 
127 


m Step 1: Rank order from 
highest to lowest 
m Highlight data 
= Data 
= Sort 
m= Descending 


F 
J 
B 
N 
F 
A 
O° 
R 
Q 
S 
H 
G 
Pp 
T 
Ss 
L 
D 
M 
K 


FNWORUANWDOS 


Statistics Review 


Monthy 

m= Measures of central tendency Healthcare Rank 
Company Premiums Order 

182 
182 
175 
172 
168 
164 
164 
164 
164 
153 
152 
146 
143 
141 
140 
139 
138 
135 
133 
127 


m Step 2: Insert function 


Average 154.1 =average(B2:B21) 
Mode 164.0 =mode(B2:B21) 
Median 152.5 =median(B2:B21) 


ZFCr-mnAHVATOAOOWDO-YPmMZ2z0e7 
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Statistics Review 


=m Concept Check! 
m What is the difference between median and 
average? 


Statistics Review 


m Concept Check! 
m What is the difference between median and 
average? 
= The average sums up all the data points and 
divides by the total 
= Average will change more with extreme values 


= The median is the point where half of the data 
falls below — middle-ith point so it is not 
impacted as much by extreme values 
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Statistics Review 


Monthy 


. Concept Check! Healthcare Rank 
Company Premiums Order 


g VVhat is the difference between s00 30 
median and average? 5 


175 
172 
168 
164 
164 
164 
164 
153 
152 
146 
143 
Average =average(B2:B21) 141 

Mode =mode(B2:B21) 8 


Median =median(B2:B21) ; 138 


135 
133 
127 


oe 
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ih, GREATER CINCINNAT! COMPENSATION 
‘AND BENEFITS ASSOCIATION 


Statistics Review 


Measures of Location 


III A EE I 
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Statistics Review 


= Measures of Location 
= Percentiles 
mw 10%, 25th 50%, 75th, goth 
= The point where x% of data is below 
= Percentile is an ill-defined function 


Statistics Review 


Monthy 


m= Measures of Location ae ae 
™ Technically, 50% of the Sompany premiums... tet 
data is below: 
15235 
152.6 
152.4 
1621 


OWDmO-PmMmZ2z0e7 


Median 
[- 


50% < 


AZUrannivdséexwco 


SP NWHhOAN @ 
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Statistics Review 


= Measures of Location Ds ee er 
Company Premiums Order 
mw Excel formula 182 
182 
175 
172 
168 
164 
164 
164 
164 
153 
152 
146 
143 
141 
140 
139 
138 
135 
133 
127 


10th %ile =percentile($B$2:$B$21,. 
25th %ile =percentile($B$2:$B$21,. 
50th %ile =percentile($B$2:$B$21,. 
75th %ile =percentile($B$2:$B$21,. 
90th %ile =percentile($B$2:$B$21,. 


F 
J 
B 
N 
E 
A 
| 
oO 
R 
Q 
Cc 
H 
G 
p 
T 
S 
L 
D 
M 
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GREATER CINCINNAT! COMPE 
AND BENEFITS ASSOCIATE 


Statistics Review 


ca 


Measures of Variability 


[cE 


SLE EES ES See 
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Statistics Review 


m= Measures of Variability 
= Range 
m Interquartile range 
= Standard deviation 


= Variability tells how dispersed the data is 


Statistics Review 


Monthy 


zs Meeeu es of Variability Healthcare 
a Range Company Premiums 


= Highest value — lowest value 


182 
175 
172 
168 
164 
164 
164 
164 
153 
152 
146 
143 
141 
140 
139 
138 
135 
133 


a 182-127 
m55 


REZUrTnHNAVOTONOOWD,O-YFMZ20e7 
PNW hOON © 
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Statistics Review 


= Measures of Variability Paice ms 
a interquartile range nei ant ood ea 
mw 75% %ile — 25th %ile 
mw 165.0 — 139.8 = 25.2 75" %ile 
a 75" %ile ignores top 25% 


mw 25 %ile includes bottom 
25% of data 

= Interquartile range shows 
us the middle 50% of the 25". 
data 


PmZzoen 


AUVOIOOWDO— 


Statistics Review 


= Measures of Variability i teeinan meee 


Company Premiums Order 


= Standard deviation F 182 
182 
= Population = 


175 
Ve Oex)2/n 
= Sample = 


Vs: (x-x)2/(n-1) 


164 
164 
164 
164 
153 
152 
146 
143 
141 
140 
139 
138 
135 
133 
127 


Standard Deviation 
Sample 17.01 = STDEV(B2:B21) 
Population 16.58 = STDEVP(B2:B21) 


ASOT NAVOATOIOODO-YPmMm20e 


SPN WLR ODN OO 
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Statistics Review 


Monthy 


m= Measures of Variability Heatihcare: Rarik 
. z Company Premiums Order 
= Standard deviation ~~. 0 


182 20 
= How do you use it? i75 i8 
» Identify outliers 
=» Compare data sets 


172 17 
168 16 
164 15 
» Z score: formula to 
determine how many 
standard deviations one 


164 
164 
point is from the mean 


164 
153 
152 
146 
143 
141 
140 
139 
138 
135 
133 
127 


F 
J 
B 
N 
E 
A 
| 
oO 
R 
Q 
Cc 
H 
G 
p 
els 
Ss 
L 
D 
M 
K 


<Cercereere ey 
FNWBRAOADANWOO ANY ®WDR 


Statistics Review 


m Measures of Variability Pion ie ten 


= Standard deviation Company Premiums Order 
182 20 
m= How do you use it? ee . 
= 2 sigma rule: 75% 172 17 

= 3 sigma rule: 88.9% 


168 16 
164 15 
164 14 
164 13 
164 12 
153 1 
152 10 
146 
143 
141 
140 
139 
138 
136 
133 
127 


Average = 154.1 
+2x 17.01 = 188.12 
- 2x 17.01 = 120.08 


At least 75% of the data will fall 
between these 2 points 


ARZOUrNAVOATLTAOOWDO-YFmM2Z2z20e7 
o 


SP NWAODN Ow 
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ibhy, GREATER CINCINI OMPENSATION 
AND BENEFIT CIATION 


Practical Applications 


Percentile Bar 


SS SE 


Percentile Bar 


m= Turn numbers into a picture 


Survey Salary Data 


Sr Accountant $43,500 $49,000 $52,000 $56,500 $61,200 
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Percentile Bar 


Survey Salary Data 


Sr Accountant $43,500 $49,000 $52,000 $56,500 $61,200 


SrAccountant $43,500 $5,500 $3,000 


25%ile — 5otre%ile — || 75%ile — || 9Ot%%ile — 
10h%ile D5thopile 50%%ile 


75’ %ile 


Percentile Bar 


Chart Wizard - Step 1 of 4 - Chart lype 


Standard Types | Custom Types | 


m@ Highlight data 
Chart type: Chart sub-type: 


Picuen FF m@ Insert 
Er Bar i 

lex Line : lel m@ Chart 
acme | = , 5 |_| Stacked bar 
bd Area i L 

@ Doughnut 
gar Radar 
ae) Surface 


® 3 Bubble 


lagg Stock 


Stacked Column, Compares the 
contribution of each value to a total across 
categories, 


sessted 


= i 
ire 


GR2 = RES.17 


© WorldatWork. All rights reserved. 


Percentile Bar 


$50,000 So 


Format y-axis, no fill color — 


change minimum 
to $40,000 


$10,000 


Sr Accountant 


Percentile Bar 


90th%ile 


7T5"%ile 


50h%ile 


25%ile 


10%%ile 


$40,000 


Accountant 
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Percentile Bar 


Survey Salary Data 
10th%ile 25th%ile 50th%ile 75th%ile Y9O0th%ile 
$39,300 $41,200 
Grade 9 $42,300 $45,100 $48,900 $53,400 
Grade 10 $43,500 $49,000 $52,000 $56,500 $61,200 
Grade 11 $55,300 $57,800 $60,300 $64,500 $68,700 
Grade 12 $57,900 $64100 $69,900 $74,000 $79,100 


10th%ile 25th%ile 50th%ile 75th%ile Q9Oth%ile 
Grade 8 
Grade 9 $40,800 
Grade 10 $43,500 
Grade 11 $55,300 
Grade 12 $57,900 


Percentile Bar 


$80,000 + 


Grade 11 
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Percentile Bar 


Market Grade 8 
Current Grade 8 
Market Grade 9 
Current Grade 9 
Market Grade 10 
Current Grade 10 
Market Grade 11 
Current Grade 11 
Market Grade 12 
Current Grade 12 


10th% ile 
$35,600 


$40,800 
$43,500 
$55,300 


$57,900 


25th%ile 5Oth%ile 75th%ile 9O0th%ile 


$1,600 
$2,800 
$3,000 
$2,500 


$5,800 


Percentile Bar 


$1,900 
$3,800 
$4,500 
$4,200 


$4,100 


Minimum 
$33,333 
$38,333 
$44,083 
$50,696 


$58,300 


Midpoint 
$6,667 
$7,667 
$8,817 
$10,139 


$11,660 


Maximum 


$6,666 

$7,666 

$8,816 
$10,139 


$11,660 


8 


Esl 


8 


9 


9 
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Market Grade Current Grade Market Grade Current Grade Market Grade Current Grade 


10 


Market Grade Current Grade Market Grade 


a 1 
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12 


i 
Pay Structure Design | 


SELENE SEEGER Eee ES ESSERE EEE ESE EE a Ee ee eee a SE Ee eS ee SS Sea EE ee ste 


Pay Structure Design 


m Example: Traditional Pay Structure 
m 12 pay grades 
= Midpoint for lowest grade is $18,000 
= Grade to grade increment is constant 15% 


= 30% range spread for first 4 grades, 40% range 
spread for middle 4 grades and 50% range 
spread for top 4 grades 
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Pay Structure Design 


m Example: Traditional Pay Structure 


Minimum Midpoint Maximum 
Grade 1 $18,000 


Grade 2 === = Round(62"1.15.0) ) 
Grade 3 


Grade 4 
Grade 5 
Grade 6 
Grade 7 
Grade 8 
Grade 9 
Grade 10 
Grade 11 
Grade 12 


Pay Structure Design 


m Example: Traditional Pay Structure 


Minimum Midpoint Maximum 
Grade 1 $18,000 
Grade 2 $20,700 
Grade 3 $23,805 
Grade 4 $27,376 
Grade 5 $31,482 
Grade 6 $36,204 
Grade 7 $41 635 
Grade 8 $47,880 
Grade 9 $55,062 
Grade 10 $63,321 
Grade 11 $72,819 
Grade 12 $83,742 
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Pay Structure Design 


m Example: Traditional Pay Structure 


Minimum Midpoint Maximum 
Grade 1 $18,000 
Grade 2 $20,700 


Minimum = Midpoint/ 
(1+1/2 x range 


spread) 
= Round(C2/1.15,0) 


Grade 5 $31,482 Minimum = 
Grade 6 $36,204 
Grade 7 $41 635 18,000/ 1:15 
Grade 8 $47 880 
Grade 9 $55,062 
Grade 10 $63,321 
Grade 11 $72,819 
Grade 12 $83,742 


Pay Structure Design 


m Example: Traditional Pay Structure 


Minimum Midpoint Maximum er: = é F 
Grade1 $15,652 $18,000 Minimum = Midpoint/ 


Grade2 $18,000 _—‘ $20,700 (1+1/2 x range 
Grade 3 $20,700 $23,805 spread) 

Grade 4 $23,805 _—«$27,376 

Grade 5 $31,482 Minimum = 
Grade 6 $36,204 


Grade 7 $41,635 31,482/ 1.20 
‘ = Round(C6/1.20,0) 
Grade 10 $63,321 


Grade 11 $72,819 
Grade 12 $83,742 
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Pay Structure Design 


m Example: Traditional Pay Structure 


Minimum Midpoint Maximum 2 nt 
Grade1 $15,652 $18,000 Maximum = Minimum 
Grade2 $18,000 $20,700 x (1+ range spread) 
Grade 3 $20,700 $23,805 
Grade 4 , a 
Grades $2624 natB2X190.0) J Maximum = 15,652 x 
Grade 6 $30,170 $36,204 1.30 
Grade7 $34,696 $41,635 
Grade8 $39,900 $47,880 
Grade9 $44,050 $55,062 
Grade 10 $50,657 $63,321 
Grade 11 $58,255 —«- $72,819 
Grade 12 $66,994 $83,742 


Pay Structure Design 


m Example: Traditional Pay Structure 
Range Grade 
Grade Minimum Midpoint Maximum Spread Increment 

$15,652 $18,000 $20,348 300% 15.0% 
$18,000 $20,700 $23,400 300% 15.0% 
$20,700 $23,805 $26,910 300% 15.0% 
$23,805 $27,376 $30,947 30.0% 15.0% 
$26,235 $31,482 $36,729 400% 15.0% 
$30,170 $36,204 $42,238 400% 15.0% 
$34,696 $41,635 $48,574 400% 15.0% 
$39,900 $47,880 $55,860 400% 15.0% 
$44,050 $55,062 $66,074 500% 15.0% 
$50,657 $63,321 $75,985 50.0% 15.0% 
$58,255 $72,819 $87,383 500% 15.0% 
$66,994 $83,742 $100,490 500% 
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AND BENEFITS ASSOCIATION 


Practical Applications 


Range Penetration 


‘ 
i 


|S LOIUOIDRE Lao eee ee eee ee ee eee ee eS a ee ee aD eS EES EE 


Range Penetration Chart 


m Help managers make pay decisions 
m Help reduce focus on midpoint 
m Encourage use of full pay range 


m Example: Single Department 
= Traditional pay structure 
= Divide into thirds 
= Common review date 
= 4 point rating scale (4 highest, 1 lowest) 


GR2 # RES.25 


© WorldatWork. All rights reserved. 


Insert | Format Tools Data Window Help 


Gaus m@ Assign a name to 
Columns the pay structure 


Worksheet . P 
“ATCT Be cee... ~poop oe & Highlight the pay 
Grade Minin « Eunction... win 4m structure (without 
headers) 
i Picture Paste... 
Hyperlink... Ctrl+k Create... a Insert 
$93 bogermwe a ooo Apply... = Name 
$26 235 $31 A482 t. Label... . 
$30170 $36204 $42738 = Define 
$34 696 $41 635 $48 574 = “Structure” 
$39 900 $47 880 $55,860 
$44 050 $55 062 $66 074 
$50,657 $63 321 $75,985 


©O 8 = OG hm & t 


— 
o 


Range Penetration Chart 
a C n= 
Performance Current Pay 

Employee Service Rating Pay Grade 
Michael 4 $24,000 4 
Erin $17,200 1 
Alex $31,000 5 
Stephanie $46,500 7 
Christy $47,000 9 
Sophia $89,000 2 
Natalia $52,000 8 
Mike $26,800 4 


Use Pay Grade to lookup the 
minimum and maximum of the 
pay grade 
Min = vilookup(E2,Structure,2) 
Max = viookup(E2, Structure,4) 
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Range Penetration Chart 


Performance Current Pay 
Employee Service Rating Pay Grade Minimum Maximum 
Michael 6 4 $24,000 $23,805 $30,947 
Erin $17,200 $15,652 $20,348 
Alex $31,000 $26,235 $36,729 
Stephanie $46,500 $34,696 $48,574 
Christy $47,000 $44,050 $66,074 
Sophia $89,000 $66,994 $100,490 
Natalia $52,000 $39,900 $55,860 
Mike $26,800 $23,805 $30,947 


Range Penetration Chart 


Performance Current Pay 
Employee Service Rating Pay Grade Minimum Maximum 
Michael 4 $24,000 $23,805 $30,947 
Erin $17,200 $15,652 $20,348 
Alex $31,000 $26,235 $36,729 
Stephanie $46,500 $34,696 $48,574 
Christy $47,000 $44,050 $66,074 
Sophia $89,000 $66,994 $100,490 
Natalia $52,000 $39,900 $55,860 
Mike $26,800 $23,805 $30,947 


m Range Penetration Formula 
(Current Pay — Minimum)/(Maximum — Minimum) 
= (D2-F2)/(G2-F2) 
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Range Penetration Chart 


Perf. Current Pay Range 
Employee Srvc Rating Pay Grade Minimum Maximum Penetration 
Michael 4 $24,000 4 $23,805 $30,947 3% 
Erin $17,200 1 $15,652 $20,348 33% 
Alex $31,000 5 $26,235 $36,729 45% 
Stephanie $46,500 7 $34,696 $48,574 85% 
Christy $47,000 9 $44,050 $66,074 13% 
Sophia $89,000 2 $66,994 $100,490 66% 
Natalia $52,000 8 $39,900 $55,860 76% 
Mike $26,800 4 $23,805 $30,947 42% 


Range Penetration Chart 


m Use “&’ to string together fields: 
= Name, service, performance rating 
= A28&’” Service: “&B28&” Rating: “&C2 


Perf. Current Pay Range 
Employee Srvc Rating Pay Grade Minimum Maximum Penetration 
Michael 4 $24,000 $23,805 $30,947 3% 
Erin $17,200 $15,652 $20,348 33% 
Alex $31,000 $26,235 $36,729 45% 
Stephanie $46,500 $34,696 $48,574 85% 
Christy $47,000 $44,050 $66,074 13% 
Sophia $89,000 $66,994 $100,490 66% 
Natalia $52,000 $39,900 $55,860 76% 
Mike $26,800 $23,805 $30,947 42% 
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Range Penetration Chart 


Range — sm Highlight data 


Employee, Srv, Rating Penetration 

Michael Service: 6 Rating: 4 3% m@ Insert 
Erin Service: 2 Rating: 3 33% 

Alex Service: 7 Rating: 3 45% @ Chart 
Stephanie Service: 5 Rating: 2 85% 

Christy Service: 1 Rating: 3 13% 

Sophia Service: 3 Rating: 4 66% 

Natalia Service: 15 Rating: 3 76% 

Mike Service: 9 Rating: 1 42% 


Range Penetration Chart 


Standard Types | Custom Types | 


Chart type: Chart sub-type: 


lad Colurnn i ee 
Joe Line 


@ Pie 


2 XY (Scatter) 1 
@ Doughnut / | 


far Radar 
a Surface 
@: Bubble 


Fat Stack 


Clustered Bar, Compares values across 
categories. 


Press and Hold to view Sample 
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Range Penetration Chart 


Range Penetration 


Mike Serice: 9 Rating: 1 


Natalia Service: 15 Rating: 3 
Sophia Service: 3 Rating: 4 | 


Christy Senice: 1 Rating: 3 
| PRange Penetration| 


Stephanie Servce: 5 Rating: 2 | a. - _ S | 


‘Remove legend 


Alex Senice: 7 Rating: 3 
Erin Service: 2 Rating: 3 [ a S | 
Michael Service: 6 Rating: 4 0 


0% 


Format x axis 


Range Penetration Chart 


3 3 
Patterns Scale = Font =|) Number | Alignment | 
Value (V) axis scale 
Auto 
Mipinurn: 


Maximum: 


Major unit: [-333] 
Minor unit: o.02 


Category (¥) axis 
Crosses at; ia] 


Display units: 
P™ Logarithmic scale 


P Yalues in reverse order 
P Category (*) axis crosses at maximum value 
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Range Penetration Chart 


| Format Data Series 


Patterns | Axis | YError Bars Data Labels | Series Order | Options | 


Label Contains Click on bar 


™ Category name Format data series 


i” Value 
~ Data labels 


Select “Value” 
Separator: | 7 


P™ Legend key 


Range Penetration Chart 


Range Penetration 


Mike Service: 9 Rating: 1 
Natalia Service: 15 Rating: 3 fo 


Sophia Service: 3 Rating: 4 


Christy Service: 1 Rating: 3 f 


Stephanie Service: 5 Rating: 


Alex Service: 7 Rating: 3 


Grin Service: 2 Rating: 3 


Michael Service: 6 Rating: 4 He 


0% 
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Range Penetration Chart 


Minimum 


Mike Serice: 9Rating 1 fo ee ]42% 


Natalia Service: 15 Rating: 3 


Sophia Senice: 3 Rating: 4 


Christy Senice: 1 Rating: 3 


Stephanie Service: 5 Rating 


Alex Senice: 7 Rating: 


Service: 2 Rating: 


Michael Senice: 6 Rating: Lr] 3% 


0% Still progressing 33% Fully competenent Exceptional 

Limited experience Fully capable Consistently exceeds 
New to job Moderate experience Significant expenence 

Recently promoted Solid contributor Ready for promotion 


lh, GCCBA 


GREATER CINCINNAT! COMPENSATION 
AND BENEFITS 1ON 


Practical Applications 


Merit Matrix Model 


BEE sce oasetcaoeosaeaeseotte LEE EE SEE Ee ee ES SSH PE 
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Mie 


Merit Matrix Model 


m Establish merit pay increase guidelines 
m 2 dimensional matrix 
= Performance rating (3 ratings) 
= Position in range (thirds) 
m Determine relationship between cells in matrix 
= Overall merit budget = 3.7% 


Merit Matrix Model 


Position in Salary Range 


1. Set up matri 
aeuiae Tst Third | Middle Third | Upper Third 


Pec | 


Meets 
Below 


Performance 
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Merit Matrix Model 


2. Determine Position in Salary Range 


Relationship 1st Third Middle Third | Upper Third 


a 
oO 
c 
O 
= 
Oo 
ye 
7) 
oO 


Merit Matrix Model 


3. Percent of Position in Salary Range 


Payroll in each cell 1st Third Middle Third | Upper Third 


Meets 45.0% 13.0% 


ow, 


The employees in this 
cell represent 45% of 
the total payroll 


Performance 
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Merit Matrix Model 


4. Relationship x Position in Salary Range 
Payroll 1st Third Middle Third | Upper Third 


Performance 


Really 7.5% times x 


Merit Matrix Model 


5. Solve for x 
3.70% =7.5x+ 14.4x + 5x + 18x +45x + 7.8x 
3.70% = 85.7x 
X= 3.8% 


m Add up each cell and set equal to your 
budget 3.7% 


m@ Solve for x 
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Merit Matrix Model 


6. Determine Cell Position in Salary Range 
increase 1st Third Middle Third | Upper Third 


2.3% 


Meets 4.6% 3.8% 


Performance 


Merit Matrix Model 


7. Alternative range Position in Salary Range 
for each cell 1st Third Middle Third | Upper Third 


5.0% -6.0% | 3.9% - 4.9% | 3.1%-4.1% 
3.9% -4.9% | 3.1%-4.1% | 16%-2.6% 


Performance 
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ilk, GCCBA 


SATION 


N 


Regression Analysis 


Single Regression 


LAA 


DULLER BS ESR ESSER REESE PEEP EASE SEE EERE SEER EEE ES ESSE SUSE EEE SE SESS SES HPSS SESS ES PS Eee 


Single Regression 


m Linear regression is way to ANaneeis « <y varlaple 
. Years of Exp Salary 
build a model 


$20,000 
m Uses least squares 


$22,000 
$21,000 
m Works only on linear data 


$23,300 
$25,000 
$24,500 
$26,000 
$28,000 
$26,500 
$30,000 
$28,000 
$32,000 
$33,000 
$33,500 


m Example: Model salary 
based on years of 
experience 


NO nO hWN = = 


RO jew ak a es 
onan Oo 
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Single Regression 


| $35,000 


$33,000 
| $31,000 


$29,000 + 
27,000 + 


* 
$25,000 $+ 


23,000 


i e 
$21,000 oo 
° 
| $19,000 | 


, $17,000 


15,000 


Single Regression 


Trend Regression type 


ee 
PSF 


Logarithmic Polynomial 


it ei 


Exponential Mowing Average 
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Graph data 


Confirm that it is 
linear 


Click on data point 


Select “add 
trendline” 


@ Select Linear 


Type Optons | m Under Options 


pele dente : a Select display 
fe Automatic: Linear (Salary) : 
Si ao equation on chart 


m Display r-squared 


At 10 years of service: 
Y = 668.21 x 10 + 21,474 
= $28,156.10 


Single Regression 


$40,000 


y = 668.21x + 21474 
R’ = 0.9072 


| $35,000 


| 
| $30,000 


| 
| $25,000 - 


R-squared is the coefficient of 
determination, the amount of 
variation in the pay that can be 
attributed to years of service 


$20,000 


$15,000 
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(blank) 
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Glossary 
Quantitative Methods 


A 


absolute frequency distribution 

A table indicating the number of observed data 
in defined categories. For example, one 
column could contain numeric categories (e.g., 
1-10, 11-20, etc.) or nominal categories such 
as cities (e.g., Richardson, Dallas, etc.), and 
the second column could contain the number 
or quantity in each category. 

[T3, GR2] 


aging survey data 

The practice of increasing market survey data 
by an assumed percentage representative of 
wage movement to bring the data to a 
consistent point in time. This practice also is 
known as “advancing” or “trending” the data. 
[C2, C17, T3, GR2, GR3] 


assets 

What an organization owns as shown on the 
organization’s balance sheet. 

[T2, T3, T6, GR2] 


average 

The sum of all values of a data set divided by 
the number of values in that set. Equivalent to 
the mean. 

[T3, GR2] 


B 


bar chart 


A graphical tool that helps visualize a 
frequency distribution, used with discrete data. 
[T3, GR2] 


benchmark job data 
Market and company data for surveyed jobs. 
[T3, GR2] 


e Course GR2 


bi-modal 
A distribution having two modes. 
[T3, GR2] 


C 


cash flow 

The amount paid out throughout the year 
coupled with the time of payment and all the 
attendant changes. 

[C4, T3, GR2, GR4] 


central tendency 

In statistics, some clustering around a central 
value in a distribution of data usually 
determined by one of the measures of location; 
j.e., mean, median or mode. 

[C2, C17, T3, GR2, GR3] 


central tendency error 

The result when some raters evaluate the 
individual performance of subordinates toward 
the middle of a performance scale, reflecting 
the belief that everyone’s performance is about 
average or that different performance levels 
are not discernible. 

[C11, T3, GR2, GR5] 


coefficient of determination 

The proportion of variation in the dependent 
variable that can be attributed to the 
relationship with another variable or 
combination of variables. It has values 
between 0.0 and 1.0. For simple linear 
regression, it has a value equal to the square 
of the correlation coefficient. 

[T3, GR2] 
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compa-ratio 

The ratio of an actual pay rate (numerator) to 
the midpoint or some other control point for the 
pay range (denominator). Compa-ratios are 
used to measure and monitor an individual’s 
actual rate of pay to the midpoint or control 
point of their range. A compa-ratio can be 
calculated for a group, a department or an 
entire organization. 

[C4, C17, T1, T3, GR1, GR2, GR4] 


compound interest 

The financial return that results when interest 
in succeeding periods is earned not only on the 
initial principal but also on the accumulated 
interest of prior periods. Also referred to as the 
value of money over time. 

[T3, GR2] 


compound salary growth rate (CSGR) 
The annualized percentage rate of growth 
in salary during a specified period of time. 
For example, an employee who was hired 
at $18,000 five years ago and earned 
$32,000 today has experienced a CSGR 

of 12.2 percent. 

[T3, GR2] 


convenience samples 

Drawn from a population based on ease of the 
sampling set. Even though the goal is the 
same as in random sampling (i.e., to represent 
a population) most salary surveys are 
convenience samples. 

(13, GR2] 


correlation coefficient 

A statistical index that measures the strength 
of linear association observed between two 
variables. This index sometimes is referred to 
as the correlation coefficient. Correlation will 
have a value from -1.0 (indicating a perfect 
negative relationship) to +1.0 (indicating a 
perfect positive relationship). A correlation 
coefficient of zero indicates no linear 
relationship at all between two variables. 

[C2, C17, T3, GR2, GR3] 


D 


deciles 
A distribution divided into tenths. 
[T3, GR2] 


denominator 

The number below a fraction line, indicating 
the number of parts into which one whole is 
divided. 

[T3, GR2] 


department compa-ratio 

A tool that compares a department’s total 
salary to its corresponding total midpoints in 
the pay ranges. 

[T3, GR2] 


dependent variable 

Denoted as the y-variable, it is the quantity, 
issue, problem or question that is to be 
explained, understood or predicted. 

[T3, GR2, GR3] 


discrete data 

In statistics, refers to variables that assume 
only specific, distinct values and are not 
continuous. For example, pay grades and 
performance categories are discrete variables 
while height and weight are continuous 
variables. 

[T3, GR2] 


distribution 

(1) Any payment to a participant from a defined 
benefit (DB) plan or defined contribution (DC) 
plan. (2) An ordered array of data. 

[B1, B2, T3, GR2] 


divergent data 

Measurements that vary by orders of 
magnitude or in which the bulk of cases tend to 
cluster at one extreme of a distribution. 

[T3, GR2] 
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= 


earnings per share (EPS) 

The net earnings of an organization divided by 
total shares outstanding, with adjustments for 
common stock equivalents. Sometimes used 
as a criterion in executive incentive pay 
programs. 

[C6, C6A, T11, GR2] 


F 


frequency 

The number of times (absolute frequency) or 
percentage of times (relative frequency) a 
value appears in a data set or in a category. 
[T3, GR2] 


frequency distribution 

A classification of data into mutually exclusive 
categories, with subsequent counting and 
percentage calculations. 

(13, GR2] 


future value 

The estimated value of a sum invested today 
at a specified rate of interest at some future 
point in time. For example, $1 invested for one 
year at an annual rate of five percent has a 
future value of $1.05. 

[T2, T3, T11, GR2] 


G 


gross margin 

A profit measure: sale price minus the cost of 
goods before overhead, profits and taxes. 
Gross margin may be used as a performance 
measure in sales compensation plans. 

[C5, T3, GR2] 


gross margin/gross margin ratio 

Often used as a performance measure; it is 
calculated by subtracting the cost of goods 
sold from net sales. The gross margin ratio is 
obtained by dividing the gross margin by net 
sales. 

[T2, GR2] 


income protection programs 
Programs designed to protect the standard of 
living of the employee and his or her family. 


[B1, B2, B3, B3A, B5, C15, T1, T2, T3, T6, T9, T11, GR1, GR2, GR7, 
GR8] 


independent variable 

Denoted as the x-variable, it is the quantity or 
item that is believed to have an impact on the 
dependent variable. It sometimes is referred to 
as the predictor variable. Often, more than one 
independent variable may impact the 
dependent variable. 

[T3, GR2, GR3] 


inferential statistics 

The set of scientific techniques for using 
statistics derived from a sample of data to 
estimate parameters of the subject population 
based on the mathematical theory of 
probability. 

[T3, GR2] 


intercept 

In a linear model y = a + bx, the intercept is 
denoted by the symbol a and is the value of y 
when x equals zero. Sometimes the intercept 
is also called the constant or the constant term. 
In a multiple linear model, the intercept is the 
value of y when all the x's are zero. 

[T3, GR2] 


interest rate 

The percentage of return for money invested 
over a period of time. 

[T3, GR2] 


interquartile range 

The difference between the 25th percentile 
(first quartile) and the 75th percentile (third 
quartile) in an ordered array of data. This 
range contains the middle 50 percent of the 
data. 

[C4, C15, C17, T3, GR2, GR4] 
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interval measurement 

Measurement in which equal differences on 
the scale represent the same amount, no 
matter the location on the scale. No absolute 
zero exists. 

[T3, GR2] 


L 


least squares line 

In regression analysis, the line fitted to the 
points that minimizes the sum of the squared 
deviations of the points (actual y-values). 

[T3, GR2] 


linear regression 

The statistical technique of fitting a straight line 
to a set of (x,y) data, using the method of least 
squares. Linear regression assumes that the 
basic relationship between the two variables is 
linear in nature. 

[C2, C4, C17, T3, GR2, GR3, GR4] 


linear relationship 

A situation in which the relationship between 
an x-variable and a y-variable can best be 
described by a straight line. The data points 
plotted on a graph of this relationship follow a 
straight line. 

[T3, GR2] 


linear scales 

Lines on which equal distances between 
increments represent equal intervals between 
increments. 

[T3, GR2] 


logarithmic scales 

Lines on which equal distances between 
increments represent equal ratios between 
increments. 

[T3, GR2] 


market index 

An index computed by dividing the pay 
received by an individual by the market pay for 
that job. This figure can be computed for 
groups, departments and the entire 
organization. It is designed to provide a 
measure of how organizational pay compares 
to the market. 

[C2, C4, C17, T3, GR2, GR3, GR4] 


market-based salary increase budget 

A salary increase budget based solely on what 
an organization’s salary goals are with respect 
to the competition. 

[T3, GR2] 


mean 

A simple arithmetic average obtained by 
adding a set of numbers and then dividing the 
sum by the number of items in the set. 

[C2, C5, C17, T3, GR2, GR3] 


median 

The middle item in a set of ranked data points 
containing an odd number of items. When an 
even number of items are ranked, the average 
of the two middle items is the median. 

[C2, C5, C17, T3, GR2, GR3] 


midpoint 
The salary that represents the middle of a 


given salary range or pay grade. 
[C4, C11, C15, C17, T1, T3, T9, GR1, GR2, GR4, GR7] 


midpoint progression 
See midpoint differential. 
[C2, C4, C15, T1, T3, T9, GR1, GR2, GR3, GR4, GR7] 


mode 

The category or value that occurs most 
frequently in a set of observations. In a 
frequency distribution, it is the category with 
the highest frequency. Sometimes there is 
more than one mode. 

[C2, C17, T3, GR2, GR3] 
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model 

A representation of reality that describes 
variables, the relationships among variables 
and the action of the system if changes are 
made in the variables or in the relationships. 
Models are major methods for examining 
alternative strategies and policies. They are 
used to solve problems and to understand 
relationships between data sets. 

[T3, GR2] 


multicollinearity 

In multiple linear regression, a situation where 
there is a high correlation between some of the 
x-variables, or where one x-variable is close to 
a linear combination of other x-variables. 
Multicollinearity occurs when one x-variable 
starts to become a surrogate for another x- 
variable or for a linear combination of other x- 
variables. When multicollinearity exists in a 
statistical inference situation, the model is not 
as stable as one would prefer. 

[C15, T3, GR2, GR8] 


multiple coefficient of determination 


In a multiple regression situation, the 
proportion of variation in the dependent 
variable that can be attributed to the 
relationship with the linear combination of the 
independent variables. It has values between 
0.0 and 1.0. 

[T3, GR2] 


multiple correlation coefficient 

In a multiple regression situation, a measure of 
how well data points fall on a straight line in 
multidimensional space. It has a value equal to 
the square root of the multiple coefficient of 
determination. 

[T3, GR2] 


multiple regression 

The statistical technique of creating a model of 
a y-variable (dependent variable) as a function 
of more than one x-variable (independent 
variable) using the method of least squares. It 
allows an assessment of the joint impact of 
several x-variables on the y-variable. 

[C2, C17, T3, GR2, GR3] 


N 


natural logarithm 

A logarithm that uses a base of e, a natural 
number approximately equal to 2.71828. 
[T3, GR2] 


negative correlation 

A relationship where higher values of y 
generally go with lower values of x, and lower 
values of y generally go with higher values of x. 
[T3, GR2] 


negatively skewed 

A distribution in which most of the scores 
cluster to the right of the mean and the “tail” of 
the distribution trails off to the left. The mean is 
smaller than the median, which in turn is 
typically smaller than the mode. 

[T3, GR2] 


no correlation 

A relationship in which no linear pattern is 
observed among variables. 

[13, GR2] 


nominal measurement 

Measurement that uses numbers merely as 
labels or identifiers (e.g., numbering of 
companies or numbering of jobs in a salary 
survey). 

[T3, GR2] 


nonlinear regression 

A form of statistical analysis that develops a 
model based on nonlinear or curvilinear 
relationships between variables. 

[C2, C17, T3, GR2, GR3] 


nonrandom sampling 

A process by which samples are drawn without 
using some form of randomized selection 
procedure. Findings are less statistically 
generalizable than those from random 
samplings. 

[T3, GR2] 
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nonsymmetric distribution 

A set of plotted data points in which one side is 
not the mirror image of the other. Graphed 
points tend to cluster more on one side than 
the other, leaving a “tail” on one side. 

[T3, GR2] 


normal distribution 

A particular bell-shaped distribution, used 
frequently as a basis for statistical inference. 
The fact that many naturally occurring 
phenomena are distributed normally (e.g., 
height) does not mean that organizational 
phenomena are. Many salary and performance 
distributions are non-normal. 

(13, GR2] 


numerator 

The number above a fraction line, indicating 
the number of parts of the whole. 

[T3, GR2] 


O 


ordinal measurement 

Measurement that indicates the order, or rank, 
of the items being measured. 

[T3, GR2] 


GB 


pay policy 

The desired position of an organization’s pay 
with respect to competitive pay at a certain 
point in time. 

[T3, GR2] 


pay range 

The range of pay rates, from minimum to 
maximum, established for a pay grade or 
class. Typically used to set individual 
employee pay rates. 

[C2, C4, C11, C15, C17, T1, T9, GR1, GR2, GR3, GR4, GR7] 


pay-range overlap 

The degree to which the pay ranges assigned 
to adjacent grades in a structure overlap. 
Numerically, the percentage of overlap 
between two adjacent pay ranges. 

[C4, C17, T3, GR2, GR4] 


percent 

A ratio expressed in relation to 100 that is 
obtained by dividing one number into another 
and then multiplying by 100. The sum of all 
percents related to a total amount must equal 
100 percent. 

[T3, GR2] 


percent change 

A difference in some variable over time 
expressed as a percentage of the value of that 
variable at the initial time. For example, the 
percent change in a wage rate that grows from 
$10 to $12 is 20 percent. 

[T3,, GR2] 


percent difference 

The percent that the value of one item differs 
from a referenced item. In compensation 
decision-making, the data on which action will 
be taken should be the referenced item and, as 
such, should be in the denominator of the 
calculation. That way, the percent difference 
will be expressed in terms of the reference. For 
example, market adjustment is equal to 100 x 
(market data — company data) + company data. 
[C4, T3, GR2, GR4] 


percentile 

A measure of location in a distribution of 
numbers that defines the value below which a 
given percentage of the data fall. For example, 
the 90th percentile is the point below which 

90 percent of the data fall. 

[C2, C17, T3, GR2, GR3] 


percentile bar 

A graphic device used to summarize data that 
shows percentiles of common interest in the 
data graphically. 

[T3, GR2] 
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polynomial model 

A model that can be expressed with powers of 
the x-variable. The order of the model is the 
value of the highest exponent of x. A straight 
line or linear model is a first-order polynomial: 
y = a+ bx. A quadratic model is a second- 
order polynomial: y = a + bx + cx?. A cubic 
model is a third-order polynomial: y = a + bx + 
cx? + dx*. Polynomials of order higher than on 
are used to describe nonlinear relations. 

[GR2] 


population 

In statistics, the set of all elementary units of 
interest in a given situation. 

[T3, GR2] 


positive correlation 

A relationship where higher values of y 
generally go with higher values of x, and lower 
values of y generally go with lower values of x. 
[T3, GR2] 


positively skewed 

A distribution in which most of the scores 
cluster to the left of the mean and the “tail” of 
the distribution trails off to the right. The mean 
is larger than the median, which in turn is 
larger than the mode. 

[T3, GR2] 


present value 

The value of an amount of money at the 
present time or at the beginning of a specific 
period of time. The concept recognizes that 
receipt of a dollar in five years is less desirable 
than receipt of a dollar today. Present value 
formulas take into account assumptions about 
interest rates to produce an estimate of the 
current worth of the guaranteed delivery of a 
dollar at some specified point in the future. 
Present value formulas and variants are used 
in determining funding requirements for several 
compensation and benefits programs. 

[C6, C6A, T2, T3, T6, T11, GR2] 


Q 


quartile 

A distribution divided into fourths. The first 
quartile corresponds to the 25th percentile, the 
second to the 50th percentile, the third to the 
75th percentile and the fourth to the 100th 
percentile. 

[C11, T3, GR2] 


quintiles 
A distribution divided into fifths. 
[T3, GR2] 


R 


random samples 

Drawn from a population in which every 
possible sample of that size has an equal 
chance of being selected. 

[T3, GR2] 


random sampling 

A process by which subsets of a population are 
drawn using a selection procedure in which 
every member of the population has an equal 
chance of being chosen. Common procedures 
to obtain random samples include using a table 
or random-number generator, or drawing 
numbers from a hat. 

[T3, GR2] 


range 

(1) For a set of data, the difference between 
the maximum value and the minimum value. 
(2) For a pay grade, the percentage by which 
the maximum pay exceeds the minimum. 

[C4, C17, T3, GR2, GR4] 


ratio 

With ratio scales a true zero point exists; 
measurements are meaningful and indicate 
how much of the quantity is being measured. 
[T3, GR2] 
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ratio measurement 

Measurement that indicates how much, or the 
quantity, of the item or attribute being 
measured. There is a true zero, where zero 
indicates nothing of what is being measured. 
Examples include number of employees, 
salaries, sales volume and production rates. 
Also known as cardinal measurements. 

[T3, GR2] 


regression analysis 

The statistical technique of creating a model of 
a y (dependent) variable as a function of one 
or more x (independent) variables using the 
method of least squares. 

[C2, C4, C17, T3, GR2, GR3, GR4] 


relative frequency distribution 

A table indicating the relative amounts of data 
in defined categories. One column contains the 
categories (numeric such as 1-10, 11-20, etc., 
or nominal such as Richardson, Dallas, etc.), 
and the second column contains the percent or 
proportion in each category. 

[T3, GR2] 


return on assets (ROA) 

The ratio of net income to the total assets of an 
organization. A key financial measure, 
sometimes used as a criterion in incentive 
plans. 

[C6, C6A, C11, C12, T2, T11, GR2, GR6] 


return on equity (ROE), return on 
shareholder equity (ROSE) 

The ratio of the net income attributable to 
common shares to shareholders’ equity. Often 
used as a measurement in incentive plans. 

[C6, C6A, C11, T2, T11, GR2, GR5, GR6] 


return on sales (ROS) 

The ratio of net income to net sales. A key 
financial measure, sometimes used as a 
criterion in incentive plans. 

[C6, C6A, C12, T2, GR2, GR6] 


S 


salary increase budget 

The difference between the values of an 
organization’s current pay and its pay goals. 
[T3, GR2] 


salary range 
See pay range. 
[C2, C11, C17, GR2, GR3] 


scattergram 

A mathematical technique of displaying a 
“picture” of a relationship between two 
variables by plotting (x, y) points. Also called 
a scatterplot. 

[T3, GR2] 


shapes of distribution 

The appearance of the placement of data 
values when graphed. 

[T3, GR2] 


skewed distribution 

A set of plotted data that is nonsymmetric, with 
a tail of extreme values in one direction. 

[T3, GR2] 


slope 

In a simple linear model, y = a + bx, the 
coefficient b is the change in the value of y, the 
dependent variable, for each unit change in the 
value of x, the independent variable. 
Graphically, slope represents the “rise” over 
the “run” of a given line and directly determines 
a line’s steepness. In a multiple linear model, 

y =at bixs + box2 +... + DkxXk, each xi has a 
slope bi, which is sometimes called a b-weight, 
partial regression weight or regression 
coefficient. Each b; is the change in y for each 
unit change in the corresponding x; when all 
the other xs are held constant. 

[C2, C4, C11, C12, C17, T3, GR2, GR6] 
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standard deviation 

The square root of the average squared 
difference between data points and the mean. 
Standard deviation is a measure of variability 
which indicates an average relative distance 
between each data point and the mean. The 
larger the standard deviation, the more the 
data is spread out from the mean. 
Mathematically, it is the square root of the 
variance. 

[T3, GR2] 


survey 

The gathering of information about a situation. 
Often, surveys consist of sampling data from a 
population. Examples include a benchmark 
salary survey that collects pay data for 
benchmark jobs from a defined labor market, a 
maturity salary survey that collects both pay 
and experience data from a defined labor 
market for benchmark jobs or jobs in a given 
discipline at a given degree level, and a 
benefits survey that collects benefits data from 
a defined labor market. 

[C2, C4, C15, C17, T3, T9, GR2, GR4, GR7] 


symmetric distribution 

A plotting of data in which one half is the mirror 
image of the other half. The point of symmetry 
is the median. 

[T3, GR2] 


T 


total rewards 


The monetary and non-monetary returns 
provided to employees in exchange for their 
time, talents, efforts and results. Total rewards 
involve the deliberate integration of five key 
elements that effectively attract, motivate and 
retain the talent required to achieve desired 
business results. 


[B1, B2, B3, B3A, B5, C1, C2, C3, C4, C5, C6, C6A, C11, C12, C15, C17, 
T1, T2, T3, T4, T6, T9, 711, 712, W1, W2, W3, GR1, GR2, GR3, GR4, 
GR5, GR6, GR7, GR8, GR9] 


trimmed mean 

A process used to eliminate outliers to 
determine the true mean of a data set. 
[T3, GR2] 


tri-modal 
A distribution having three modes. 
[T3, GR2] 


true zero 

Where the value of zero indicates an absence 
of what is being measured. 

[T3, GR2] 


U 


uniform distribution 

A symmetrical distribution in which the 
frequency in each interval is about the same; it 
looks “flat” when graphed. 

[T3, GR2] 


uni-modal 
A distribution having only one mode. 
[T3, GR2] 


unweighted mean 

A simple arithmetic average of individual 
means. 

[C2, C17, T3, GR2, GR3] 


V 


validity 

The quality of a measuring device that refers to 
its relevance (i.e., is the device actually 
measuring what it is intended to measure’). 
This quality is extremely important for job 
analysis and job evaluation measures as well 
as for performance appraisal devices. 

[C2, C11, C17, T3, GR2, GR3, GR5] 


variable 

A measured characteristic of an entity that can 
have a number of different values. Two 
aspects of variables are important: 1) the 
measured characteristic is of interest and 2) 
the measured characteristic varies. 

[T3, GR2] 
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variance 

A measure of dispersion (in interval or ratio 
data) that is defined as an average of the 
squared deviation of each data point from the 
mean. It is used to calculate the standard 
deviation. 

[T3, GR2] 


variation ratio 

A measurement used in statistics to represent 
how dispersed the data is. Refers to the 
percent of items that are not included in the 
mode. 

[T3, GR2] 


W 


weighted mean 

An average of means calculated by weighting 
each individual mean according to the number 
of data points that made up that individual 
mean. 

[C17, T3, GR2, GR3] 


Z 


z-score 

The number of standard deviations an item is 
above or below the mean of the set of data to 
which it belongs. It is calculated by subtracting 
the mean from the item and then dividing the 
difference by the standard deviation. 

[T3, GR2] 
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Faculty * Course GR2 


Atsmon, Tali 


Atsmon is manager of Atsmon Group - 
Advanced Compensation Solutions in Israel. 


Brown, David L., CCP, CBP 


Brown is director of compensation for Esco. 

He has more than 25 years' experience in 
compensation and benefits and has worked for 
several Fortune 500 companies. He earned his 
BS from Washington State University and an 
MBA from Portland State University. 


Buergi, Andreas, GRP 


Buergi is a managing founding partner of the 
Living Water Institute located in Switzerland. 
He has an MBA in organizational leadership 
from City University in Bellevue, Wash. 


Ceron, Mario 


Ceron is director of Peoplematters in Madrid, 
Spain. 


Hovy, Maureen, GRP 


Hovy is rewards and benefits manager, AOA 
with Huhtamaki Australia Pty Ltd and has 
specialized in rewards consulting since the late 
1980s. Her areas of expertise include base pay 
and incentive design, global rewards strategies 
and advice at board and executive levels. She 
holds a BS from the Rand Afrikaans University, 
as well as honors in education and a diploma 
in data metrics (with distinction) from the 
University of South Africa. 


Jauhal, G. Peter 


Jauhal is a consultant with Hewitt Associates in 
the U.K. He has an MA in mathematics from 
Cambridge University and is also a Fellow of 
the Institute of Actuaries. During the last twenty 
years he has specialized in the field of 
compensation and benefits consulting and has 
advised organizations from all over the world. 
He currently specializes in the design and 
implementation of short- and long-term 
incentive programs. 


Kruger, Peet 


Kruger is senior manager with Reward 
Partnership in South Africa. 


Marcos, Susana 


Marcos is a partner with Peoplematters in 
Madrid, Spain. 


Parent, Nathalie, CCP, CHRP 


Parent is director of HR Europe for Cognos Ltd 
in the United Kingdom. 


Peacoe, Ray W., CCP, CBP, GRP 


Peacoe is president of The Human Resource 
Consultancy specializing in human resource 
strategy, compensation and benefits. He holds 
a BS from Westminster College and an MS 
from Indiana University. He is a former 
member of the WorldatWork board of directors. 


Pillai, Ashok 


Pillai is head of rewards management for 
BP International in London. 
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Rimsky, Tolo M., CCP, GRP 


Rimsky is a principal at the Center for 
Workforce Effectiveness and specializes in 
compensation consulting and other issues in 
Latin America. He holds a BS and an MS in 
engineering from Case Institute of Technology/ 
Case Western Reserve University. 


Robins, Clyde J. CBP, CPA 


Robins is vice president of human resources at 
Nutraceutical International Corp. He holds a 
BA in accounting from Brigham Young 
University. Robins is licensed in health, life and 
disability insurance and is a Certified Public 
Accountant. 


Roeser, Douglas P., CCP, CBP, GRP 


Roeser is assistant professor at Sierra Nevada 
College. He holds BA and MBA degrees from 
the University of North Carolina at Chapel Hill. 


Rubino, John A., CCP, CBP, GRP 


Rubino is president of Rubino Consulting 
Services (RCS) in Pound Ridge, N.Y. 

His areas of expertise are in the design and 
implementation of executive and employee 
incentive plans, performance management 
systems, management training programs, 
motivational training and human resources 
communications. He holds a BA, with honors, 
from Wagner College and an MBA, with 
distinction, from Pace University. 


Ryder, Averile, GRP 


Ryder is the founding member of Averile Ryder 
Remuneration & Outsourced Payroll 
Consulting, which specializes in developing 
effective human resource and remuneration 
solutions. Averile has been in the human 
resources field for over 25 years and holds an 
Advanced Diploma in Human Resources. She 
is Chairperson for the SARA KZN Committee 
and Vice Chairperson on the Durban Chamber 
Human Resources Committee. 


Seltz, Steven P., CCP 


Seltz is senior director of compensation and 
benefits for Siemens Corp. He holds a BA from 
Vassar College and an MBA from the Wharton 
School at the University of Pennsylvania. Seltz 
has served as a member of WorldatWork 
board of directors. 


Sukovaty, Charles S., CCP, CBP, SPHR 


Sukovaty currently works for Mercer Human 
Resource Consulting. He holds a bachelor's 
degree in human relations from Doane College. 
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WorldatWork Society of 
Certified Professionals. 


Certification Examination 


Information 


About WorldatWork Society of Certified Professionals® 


WorldatWork Society of Certified Professionals is a not-for-profit organization dedicated to providing 
certification in the human resources disciplines of compensation, benefits and total rewards. The Society 
issues the Certified Compensation Professional (CCP®), the Certified Benefits Professional® (CBP) and 
the Global Remuneration Professional (GRP®) designations. It also serves as an alumni society for CCP, 
CBP and GRP certified professionals. The organization is an affiliate of WorldatWork, the professional 
association dedicated to knowledge leadership in compensation, benefits and total rewards. 


www.worldatworksociety.org 
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Certification Examination Information 


Development 


Internal and external subject matter experts 
contribute to the development of the content 
of WorldatWork courses as well as the 
examinations. Exam questions are reviewed 
by an internal examination team to ensure that 
items are properly scripted and meet the 
standards of good measurement practices. 
Exam reviews consist of, but are not limited 
to, checking items for appropriate content, 
accuracy, and for clues that would help the 
person taking the exam answer the item 
correctly. For new courses, a pilot of the 
course is conducted, culminating with an 
examination. 


The final step in the examination development 
process occurs when scores are received, 
indicating how well the exam performed. 
Subsequently, exams continue to receive 
regular reviews to assess performance and 
currency. 


Specifications 


Exam specifications serve as a guide for the 
development of WorldatWork examinations 
and consist of two parts: (1) the content or 
knowledge assessed by the test, and (2) the 
statistical analysis of the items of the test. 


The content or knowledge contained in the 
examinations is derived by a comprehensive 
breakdown of the material covered in each 
course, based on the Body of Knowledge. 
Exams are developed and refined on an 
ongoing basis by teams of staff and 
volunteers with expertise in each subject area. 
Questions assess mastery of the topics 
covered in the course books as the materials 
relate to the Body of Knowledge. Examination 
developers weigh the amount of key 
information presented in each module. 


Examination items cover the overall Body of 
Knowledge presented in the course, and 
reflect relative emphasis required in each 
module. Each test item is written based upon 
the module objectives and content to measure 
specific types of knowledge, such as recall, 
application, analysis, evaluation, etc. 


Time allotted for completion of each 
examination is four and one-half hours; 
however, most participants complete 
examinations within one and one-half to 
two hours. 


An item-by-item review of test items is 
performed to ensure that all test items meet 
the prescribed criteria. Each examination item 
is analysed to determine exactly where it is 
presented in the course material. The analysis 
is conducted on an item-by-item, module-by- 
module basis. All examination items are 
developed with this balance in mind. 
Examination items must be clear and 
straightforward to reduce confusion and 
frustration, which would limit participants’ 
opportunity for success. 


Test items are subject to continual scrutiny by 
statistical analysis as well. Comprehensive 
statistical analysis is conducted on each exam 
by Arizona State University’s Testing Center, 
which provides ongoing validity and reliability 
information. Exam results are reported to 
students electronically, or by mail, upon 
request. Proprietary transcript records are 
maintained at headquarters. 


Each examination: 


1. Reflects course objectives, course 
materials and overall presentation to 
maintain content validity. 


2. Provides a balanced sequence of exam 
items according to class presentation 
and objectives. 


3. Contains test items with the appropriate 
number of alternative distractors to 
accurately determine discrimination. 


4. Is monitored using an item analysis for 
evaluating internal consistency and 
reliability. 


5. Serves as a measure of knowledge 
of course content and the Body of 
Knowledge. 
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The monitoring process consists of an 
ongoing review of the following factors: 


1. The security of examinations. 


2. The maturity, currency and relevancy of 
the examination over time. 


3. A comprehensive item analysis, utilizing 
statistical procedures that include item 
discrimination and item difficulty 
assessment. 


4. Amonitoring of all items to identify those 
that consistently perform poorly on item 
analysis criteria and should be eliminated 
or revised. 


5. The implementation of adjustments 
necessary in the scoring base to reflect 
the integrity, fairness, and credibility of 
the examination. 


Certification examinations are designed to 
measure a specified level of proficiency 
relative to course content. This serves to 
link the exams to the associated Body of 
Knowledge. 


The minimum passing score (cut score) for 
all certification exams is 75 per cent. By 
definition, this serves as the accepted 
threshold of mastery. 


Item Type and 
Number of Options 


Multiple-choice questions are used for 
certification exams. This type of question 
was chosen for the following reasons: 


1. Multiple-choice questions are easily 
adapted to meet a variety of testing 
situations. 


2. Psychometric research indicates that 
multiple-choice questions are more 
reliable than other types of questions. 
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3. Multiple-choice questions can assess a 
wide range of knowledge, skills and 
abilities. 


4. Multiple-choice exams can be scored 
electronically. 


Based on research and contemporary 
measurement practice, the number of options 
(responses to the questions, i.e., A, B, C) 
ranges from 2 to 4. 


The following are the three components of 
multiple-choice questions: 


1. Stem: An opening or introductory 
statement, typically in the form a question. 


2. Options: Each choice (A, B, C) is an 
option. There is one BEST option, which is 
the correct response (which would be the 
consensus of a panel of experts). 


3. Distractors: The incorrect choices are the 
distractors. Distractors are believable 
responses; however wrong, or simply not 
the BEST response. 


Exam Versions 


WorldatWork Society of Certified 
Professionals updates exams regularly. 

The examination policy is to administer exams 
based on the most current content in the 
profession. Exam versions maintained in 
inventory assure individuals purchasing 
course binders that the content assessed on 
the exams will be adequately covered in the 
course binders for one year after the date of 
binder purchase or course registration. If 
more than one year has elapsed since binder 
purchase or course registration, it is possible 
that some questions on the exam may not be 
covered in the course materials. To view the 
current course outline/description online go to 


www.worldatwork.org. 
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Exam-Taking 
Suggestions 


Examination anxiety is something that many 
people experience. The information contained 
in this document is intended to provide you 
with sufficient information to lessen possible 
anxiety that you may experience. 


The following suggestions are offered to help 
you perform to the best of your capabilities 
and knowledge: 


1. Trust your first instinct. 


2. Avoid over analysing. Caution against 
reading too much into the questions. 


3. When in doubt, move on and return. 
4. Proceed through the test in order. 


5. Make educated guesses. If you are 
fluctuating between one or two choices, 
guess at one. There is no penalty for 
guessing. 


6. Use the margins of the exam booklet. 
Utilize the spaces available on the exam 
booklet to make any calculations, make 
notes and to note any questions that you 
may want to skip over and return to later. 


7. Review your answers. When finished with 
the test, scan it to ensure that you have 
provided an answer for each question. 


8. Use time wisely. There is adequate time 
to complete the test. Proceed through 
the test at a pace that is comfortable for 


you and that will allow you time to review 


and to check your answers. 


Fair Testing 
Practices 


Attention is focused throughout the 
examination development process on fair 
testing practices. Some of the factors that 
are included in this evaluation process are 
item writing and reviewing, item banking, 
item selection and examination review. 


The following support the efforts for fair 
testing: 


1. Exam developers are provided training in 
item writing. 


2. Examinations undergo reviews for any 
potential bias. 


3. Item performance is reviewed, including 
the instructional sensitivity of the item 
(the item needs to be difficult for those 
untrained and easy for those trained). 
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